Full Text Available

Access Repository Access Repository

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Intonation modelling for the Nguni languages

Dissertation (MSc (Computer Science))--University of Pretoria, 2006.

Saved in:

Bibliographic Details
Other Authors:	Barnard, E.
Format:	Thesis
Published:	University of Pretoria 2013
Subjects:	Intonation corpus Intensity Intonation modelling Pitch tracking Autocorrelation Classification Tone Fundamental frequency Prosody Nguni languages UCTD
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1867613582780268544
access_status_str	Open Access
author2	Barnard, E.
author_browse	Barnard, E.
author_facet	Barnard, E.
collection	Thesis
dc_rights_str_mv	© University of Pretor
description	Dissertation (MSc (Computer Science))--University of Pretoria, 2006.
format	Thesis
id	oai:repository.up.ac.za:2263/28847
institution	University of Pretoria (South Africa)
last_indexed	2026-06-10T12:38:26.847Z
license_str	Other — see source repository
provenance_str_mv	Harvested via OAI-PMH from UPSpace — University of Pretoria Institutional Repository
publishDate	2013
publishDateRange	2013
publishDateSort	2013
publisher	University of Pretoria
publisherStr	University of Pretoria
record_format	dspace
source_str	UPSpace — University of Pretoria Institutional Repository
spelling	oai:repository.up.ac.za:2263/28847 Intonation modelling for the Nguni languages Barnard, E. ngovender@csir.co.za Govender, Natasha Intonation corpus Intensity Intonation modelling Pitch tracking Autocorrelation Classification Tone Fundamental frequency Prosody Nguni languages UCTD Dissertation (MSc (Computer Science))--University of Pretoria, 2006. Although the complexity of prosody is widely recognised, there is a lack of widely-accepted descriptive standards for prosodic phenomena. This situation has become particularly noticeable with the development of increasingly capable text-to-speech (TTS) systems. Such systems require detailed prosodic models to sound natural. For the languages of Southern Africa, the deficiencies in our modelling capabilities are acute. Little work of a quantitative nature has been published for the languages of the Nguni family (such as isiZulu and isiXhosa), and there are significant contradictions and imprecisions in the literature on this topic. We have therefore embarked on a programme aimed at understanding the relationship between linguistic and physical variables of a prosodic nature in this family of languages. We then use the information/knowledge gathered to build intonation models for isiZulu and isiXhosa as representatives of the Nguni languages. Firstly, we need to extract physical measurements from the voice recordings of the Nguni family of languages. A number of pitch tracking algorithms have been developed; however, to our knowledge, these algorithms have not been evaluated formally on a Nguni language. In order to decide on an appropriate algorithm for further analysis, evaluations have been performed on two stateof- the-art algorithms namely the Praat pitch tracker and Yin (developed by Alain de Cheveingn´e). Praat’s pitch tracker algorithm performs somewhat better than Yin in terms of gross and fine errors and we use this algorithm for the rest of our analysis.<./p> For South African languages the task of building an intonation model is complicated by the lack of intonation resources available. We describe the methodology used for developing a generalpurpose intonation corpus and the various methods implemented to extract relevant features such as fundamental frequency, intensity and duration from the spoken utterances of these languages. In order to understand how the ‘expected’ intonation relates to the actual measured characteristics extracted, we developed two different statistical approaches to build intonation models for isiZulu and isiXhosa. The first is based on straightforward statistical techniques and the second uses a classifier. Both intonation models built produce fairly good accuracy for our isiZulu and isiXhosa sets of data. The neural network classifier used produces slightly better results for both sets of data than the statistical method. The classification model is also more robust and can easily learn from the training data. We show that it is possible to build fairly good intonation models for these languages using different approaches, and that intensity and fundamental frequency are comparable in predictive value for the ascribed tone. Computer Science MSc unrestricted 2013-09-07T14:22:12Z 2007-11-08 2013-09-07T14:22:12Z 2007-04-25 2006 2007-10-19 Dissertation Govender, N 2006, Intonation modelling for the Nguni languages, MSc Dissertation, University of Pretoria, Pretoria, viewed yymmdd <http://hdl.handle.net/2263/28847> Pretoria http://hdl.handle.net/2263/28847 http://upetd.up.ac.za/thesis/available/etd-10192007-145737/ © University of Pretor application/pdf University of Pretoria
spellingShingle	Intonation corpus Intensity Intonation modelling Pitch tracking Autocorrelation Classification Tone Fundamental frequency Prosody Nguni languages UCTD Intonation modelling for the Nguni languages
title	Intonation modelling for the Nguni languages
title_full	Intonation modelling for the Nguni languages
title_fullStr	Intonation modelling for the Nguni languages
title_full_unstemmed	Intonation modelling for the Nguni languages
title_short	Intonation modelling for the Nguni languages
title_sort	intonation modelling for the nguni languages
topic	Intonation corpus Intensity Intonation modelling Pitch tracking Autocorrelation Classification Tone Fundamental frequency Prosody Nguni languages UCTD
url	http://hdl.handle.net/2263/28847 http://upetd.up.ac.za/thesis/available/etd-10192007-145737/

Full Text Available

Intonation modelling for the Nguni languages

Similar Items