Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Intonation modelling for the Nguni languages

Dissertation (MSc (Computer Science))--University of Pretoria, 2006.

Saved in:
Bibliographic Details
Other Authors: Barnard, E.
Format: Thesis
Published: University of Pretoria 2013
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613582780268544
access_status_str Open Access
author2 Barnard, E.
author_browse Barnard, E.
author_facet Barnard, E.
collection Thesis
dc_rights_str_mv © University of Pretor
description Dissertation (MSc (Computer Science))--University of Pretoria, 2006.
format Thesis
id oai:repository.up.ac.za:2263/28847
institution University of Pretoria (South Africa)
last_indexed 2026-06-10T12:38:26.847Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from UPSpace — University of Pretoria Institutional Repository
publishDate 2013
publishDateRange 2013
publishDateSort 2013
publisher University of Pretoria
publisherStr University of Pretoria
record_format dspace
source_str UPSpace — University of Pretoria Institutional Repository
spelling oai:repository.up.ac.za:2263/28847 Intonation modelling for the Nguni languages Barnard, E. ngovender@csir.co.za Govender, Natasha Intonation corpus Intensity Intonation modelling Pitch tracking Autocorrelation Classification Tone Fundamental frequency Prosody Nguni languages UCTD Dissertation (MSc (Computer Science))--University of Pretoria, 2006. Although the complexity of prosody is widely recognised, there is a lack of widely-accepted descriptive standards for prosodic phenomena. This situation has become particularly noticeable with the development of increasingly capable text-to-speech (TTS) systems. Such systems require detailed prosodic models to sound natural. For the languages of Southern Africa, the deficiencies in our modelling capabilities are acute. Little work of a quantitative nature has been published for the languages of the Nguni family (such as isiZulu and isiXhosa), and there are significant contradictions and imprecisions in the literature on this topic. We have therefore embarked on a programme aimed at understanding the relationship between linguistic and physical variables of a prosodic nature in this family of languages. We then use the information/knowledge gathered to build intonation models for isiZulu and isiXhosa as representatives of the Nguni languages. Firstly, we need to extract physical measurements from the voice recordings of the Nguni family of languages. A number of pitch tracking algorithms have been developed; however, to our knowledge, these algorithms have not been evaluated formally on a Nguni language. In order to decide on an appropriate algorithm for further analysis, evaluations have been performed on two stateof- the-art algorithms namely the Praat pitch tracker and Yin (developed by Alain de Cheveingn´e). Praat’s pitch tracker algorithm performs somewhat better than Yin in terms of gross and fine errors and we use this algorithm for the rest of our analysis.<./p> For South African languages the task of building an intonation model is complicated by the lack of intonation resources available. We describe the methodology used for developing a generalpurpose intonation corpus and the various methods implemented to extract relevant features such as fundamental frequency, intensity and duration from the spoken utterances of these languages. In order to understand how the ‘expected’ intonation relates to the actual measured characteristics extracted, we developed two different statistical approaches to build intonation models for isiZulu and isiXhosa. The first is based on straightforward statistical techniques and the second uses a classifier. Both intonation models built produce fairly good accuracy for our isiZulu and isiXhosa sets of data. The neural network classifier used produces slightly better results for both sets of data than the statistical method. The classification model is also more robust and can easily learn from the training data. We show that it is possible to build fairly good intonation models for these languages using different approaches, and that intensity and fundamental frequency are comparable in predictive value for the ascribed tone. Computer Science MSc unrestricted 2013-09-07T14:22:12Z 2007-11-08 2013-09-07T14:22:12Z 2007-04-25 2006 2007-10-19 Dissertation Govender, N 2006, Intonation modelling for the Nguni languages, MSc Dissertation, University of Pretoria, Pretoria, viewed yymmdd <http://hdl.handle.net/2263/28847> Pretoria http://hdl.handle.net/2263/28847 http://upetd.up.ac.za/thesis/available/etd-10192007-145737/ © University of Pretor application/pdf University of Pretoria
spellingShingle Intonation corpus
Intensity
Intonation modelling
Pitch tracking
Autocorrelation
Classification
Tone
Fundamental frequency
Prosody
Nguni languages
UCTD
Intonation modelling for the Nguni languages
title Intonation modelling for the Nguni languages
title_full Intonation modelling for the Nguni languages
title_fullStr Intonation modelling for the Nguni languages
title_full_unstemmed Intonation modelling for the Nguni languages
title_short Intonation modelling for the Nguni languages
title_sort intonation modelling for the nguni languages
topic Intonation corpus
Intensity
Intonation modelling
Pitch tracking
Autocorrelation
Classification
Tone
Fundamental frequency
Prosody
Nguni languages
UCTD
url http://hdl.handle.net/2263/28847
http://upetd.up.ac.za/thesis/available/etd-10192007-145737/