Full Text Available

Access Repository

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Automatic phoneme recognition of South African English

Thesis (MEng)--University of Stellenbosch, 2004.

Saved in:

Bibliographic Details
Main Author:	Engelbrecht, Herman Arnold
Other Authors:	Du Preez, J. A.
Format:	Thesis
Language:	en_ZA
Published:	Stellenbosch : Stellenbosch University 2012
Subjects:	Automatic speech recognition Speech processing systems Natural language processing (Computer science) Computational linguistics Dissertations > Electrical and electronic engineering Theses > Electrical and electronic engineering
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1867613838696775680
access_status_str	Open Access
author	Engelbrecht, Herman Arnold
author2	Du Preez, J. A.
author_browse	Du Preez, J. A. Engelbrecht, Herman Arnold
author_facet	Du Preez, J. A. Engelbrecht, Herman Arnold
author_sort	Engelbrecht, Herman Arnold
collection	Thesis
dc_rights_str_mv	Stellenbosch University
description	Thesis (MEng)--University of Stellenbosch, 2004.
format	Thesis
id	oai:scholar.sun.ac.za:10019.1/49867
institution	Stellenbosch University (South Africa)
language	en_ZA
last_indexed	2026-06-10T12:42:30.205Z
license_str	Other — see source repository
provenance_str_mv	Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate	2012
publishDateRange	2012
publishDateSort	2012
publisher	Stellenbosch : Stellenbosch University
publisherStr	Stellenbosch : Stellenbosch University
record_format	dspace
source_str	SUNScholar — Stellenbosch University Repository
spelling	oai:scholar.sun.ac.za:10019.1/49867 Automatic phoneme recognition of South African English Engelbrecht, Herman Arnold Du Preez, J. A. Stellenbosch University. Faculty of Engineering. Dept. of Electrical and Electronic Engineering. Automatic speech recognition Speech processing systems Natural language processing (Computer science) Computational linguistics Dissertations -- Electrical and electronic engineering Theses -- Electrical and electronic engineering Thesis (MEng)--University of Stellenbosch, 2004. ENGLISH ABSTRACT: Automatic speech recognition applications have been developed for many languages in other countries but not much research has been conducted on developing Human Language Technology (HLT) for S.A. languages. Research has been performed on informally gathered speech data but until now a speech corpus that could be used to develop HLT for S.A. languages did not exist. With the development of the African Speech Technology Speech Corpora, it has now become possible to develop commercial applications of HLT. The two main objectives of this work are the accurate modelling of phonemes, suitable for the purposes of LVCSR, and the evaluation of the untried S.A. English speech corpus. Three different aspects of phoneme modelling was investigated by performing isolated phoneme recognition on the NTIMIT speech corpus. The three aspects were signal processing, statistical modelling of HMM state distributions and context-dependent phoneme modelling. Research has shown that the use of phonetic context when modelling phonemes forms an integral part of most modern LVCSR systems. To facilitate the context-dependent phoneme modelling, a method of constructing robust and accurate models using decision tree-based state clustering techniques is described. The strength of this method is the ability to construct accurate models of contexts that did not occur in the training data. The method incorporates linguistic knowledge about the phonetic context, in conjunction with the training data, to decide which phoneme contexts are similar and should share model parameters. As LVCSR typically consists of continuous recognition of spoken words, the contextdependent and context-independent phoneme models that were created for the isolated recognition experiments are evaluated by performing continuous phoneme recognition. The phoneme recognition experiments are performed, without the aid of a grammar or language model, on the S.A. English corpus. As the S.A. English corpus is newly created, no previous research exist to which the continuous recognition results can be compared to. Therefore, it was necessary to create comparable baseline results, by performing continuous phoneme recognition on the NTIMIT corpus. It was found that acceptable recognition accuracy was obtained on both the NTIMIT and S.A. English corpora. Furthermore, the results on S.A. English was 2 - 6% better than the results on NTIMIT, indicating that the S.A. English corpus is of a high enough quality that it can be used for the development of HLT. AFRIKAANSE OPSOMMING: Automatiese spraak-herkenning is al ontwikkel vir ander tale in ander lande maar, daar nog nie baie navorsing gedoen om menslike taal-tegnologie (HLT) te ontwikkel vir Suid- Afrikaanse tale. Daar is al navorsing gedoen op spraak wat informeel versamel is, maar tot nou toe was daar nie 'n spraak databasis wat vir die ontwikkeling van HLT vir S.A. tale. Met die ontwikkeling van die African Speech Technology Speech Corpora, het dit moontlik geword om HLT te ontwikkel vir wat geskik is vir kornmersiele doeleindes. Die twee hoofdoele van hierdie tesis is die akkurate modellering van foneme, geskik vir groot-woordeskat kontinue spraak-herkenning (LVCSR), asook die evaluasie van die S.A. Engels spraak-databasis. Drie aspekte van foneem-modellering word ondersoek deur isoleerde foneem-herkenning te doen op die NTIMIT spraak-databasis. Die drie aspekte wat ondersoek word is sein prosessering, statistiese modellering van die HMM toestands distribusies, en konteksafhanklike foneem-modellering. Navorsing het getoon dat die gebruik van fonetiese konteks 'n integrale deel vorm van meeste moderne LVCSR stelsels. Dit is dus nodig om robuuste en akkurate konteks-afhanklike modelle te kan bou. Hiervoor word 'n besluitnemingsboom- gebaseerde trosvormings tegniek beskryf. Die tegniek is ook in staat is om akkurate modelle te bou van kontekste van nie voorgekom het in die afrigdata nie. Om te besluit watter fonetiese kontekste is soortgelyk en dus model parameters moet deel, maak die tegniek gebruik van die afrigdata en inkorporeer taalkundige kennis oor die fonetiese kontekste. Omdat LVCSR tipies is oor die kontinue herkenning van woorde, word die konteksafhanklike en konteks-onafhanklike modelle, wat gebou is vir die isoleerde foneem-herkenningseksperimente, evalueer d.m.v. kontinue foneem-herkening. Die kontinue foneemherkenningseksperimente word gedoen op die S.A. Engels databasis, sonder die hulp van 'n taalmodel of grammatika. Omdat die S.A. Engels databasis nuut is, is daar nog geen ander navorsing waarteen die result ate vergelyk kan word nie. Dit is dus nodig om kontinue foneem-herkennings result ate op die NTIMIT databasis te genereer, waarteen die S.A. Engels resulte vergelyk kan word. Die resulate dui op aanvaarbare foneem her kenning op beide die NTIMIT en S.A. Engels databassise. Die resultate op S.A. Engels is selfs 2 - 6% beter as die resultate op NTIMIT, wat daarop dui dat die S.A. Engels spraak-databasis geskik is vir die ontwikkeling van HLT. 2012-08-27T11:33:08Z 2012-08-27T11:33:08Z 2004-03 Thesis http://hdl.handle.net/10019.1/49867 en_ZA Stellenbosch University 166 leaves : ill. application/pdf Stellenbosch : Stellenbosch University
spellingShingle	Automatic speech recognition Speech processing systems Natural language processing (Computer science) Computational linguistics Dissertations -- Electrical and electronic engineering Theses -- Electrical and electronic engineering Engelbrecht, Herman Arnold Automatic phoneme recognition of South African English
title	Automatic phoneme recognition of South African English
title_full	Automatic phoneme recognition of South African English
title_fullStr	Automatic phoneme recognition of South African English
title_full_unstemmed	Automatic phoneme recognition of South African English
title_short	Automatic phoneme recognition of South African English
title_sort	automatic phoneme recognition of south african english
topic	Automatic speech recognition Speech processing systems Natural language processing (Computer science) Computational linguistics Dissertations -- Electrical and electronic engineering Theses -- Electrical and electronic engineering
url	http://hdl.handle.net/10019.1/49867
work_keys_str_mv	AT engelbrechthermanarnold automaticphonemerecognitionofsouthafricanenglish

Full Text Available

Automatic phoneme recognition of South African English

Similar Items