Full Text Available

Access Repository

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Fast accurate diphone-based phoneme recognition

Thesis (MScEng (Electrical and Electronic Engineering))--University of Stellenbosch, 2009.

Saved in:

Bibliographic Details
Main Author:	Du Preez, Marianne
Other Authors:	Du Preez, J. A.
Format:	Thesis
Language:	English
Published:	Stellenbosch : University of Stellenbosch 2009
Subjects:	Phoneme recognition Diphones Acoustic speech modelling Theses > Electrical and electronic engineering Dissertations > Electrical and electronic engineering Automatic speech recognition Electrical and Electronic Engineering
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1867614026290167808
access_status_str	Open Access
author	Du Preez, Marianne
author2	Du Preez, J. A.
author_browse	Du Preez, J. A. Du Preez, Marianne
author_facet	Du Preez, J. A. Du Preez, Marianne
author_sort	Du Preez, Marianne
collection	Thesis
dc_rights_str_mv	University of Stellenbosch
description	Thesis (MScEng (Electrical and Electronic Engineering))--University of Stellenbosch, 2009.
format	Thesis
id	oai:scholar.sun.ac.za:10019.1/1779
institution	Stellenbosch University (South Africa)
language	English
last_indexed	2026-06-10T12:45:29.584Z
license_str	Other — see source repository
provenance_str_mv	Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate	2009
publishDateRange	2009
publishDateSort	2009
publisher	Stellenbosch : University of Stellenbosch
publisherStr	Stellenbosch : University of Stellenbosch
record_format	dspace
source_str	SUNScholar — Stellenbosch University Repository
spelling	oai:scholar.sun.ac.za:10019.1/1779 Fast accurate diphone-based phoneme recognition Du Preez, Marianne Du Preez, J. A. Engelbrecht, H. A. University of Stellenbosch. Faculty of Engineering. Dept. of Electrical and Electronic Engineering. Phoneme recognition Diphones Acoustic speech modelling Theses -- Electrical and electronic engineering Dissertations -- Electrical and electronic engineering Automatic speech recognition Electrical and Electronic Engineering Thesis (MScEng (Electrical and Electronic Engineering))--University of Stellenbosch, 2009. Statistical speech recognition systems typically utilise a set of statistical models of subword units based on the set of phonemes in a target language. However, in continuous speech it is important to consider co-articulation e ects and the interactions between neighbouring sounds, as over-generalisation of the phonetic models can negatively a ect system accuracy. Traditionally co-articulation in continuous speech is handled by incorporating contextual information into the subword model by means of context-dependent models, which exponentially increase the number of subword models. In contrast, transitional models aim to handle co-articulation by modelling the interphone dynamics found in the transitions between phonemes. This research aimed to perform an objective analysis of diphones as subword units for use in hidden Markov model-based continuous-speech recognition systems, with special emphasis on a direct comparison to a context-dependent biphone-based system in terms of complexity, accuracy and computational e ciency in similar parametric conditions. To simulate practical conditions, the experiments were designed to evaluate these systems in a low resource environment { limited supply of training data, computing power and system memory { while still attempting fast, accurate phoneme recognition. Adaptation techniques designed to exploit characteristics inherent in diphones, as well as techniques used for e ective parameter estimation and state-level tying were used to reduce resource requirements while simultaneously increasing parameter reliability. These techniques include diphthong splitting, utilisation of a basic diphone grammar, diphone set completion, maximum a posteriori estimation and decision-tree based state clustering algorithms. The experiments were designed to evaluate the contribution of each adaptation technique individually and subsequently compare the optimised diphone-based recognition system to a biphone-based recognition system that received similar treatment. Results showed that diphone-based recognition systems perform better than both traditional phoneme-based systems and context-dependent biphone-based systems when evaluated in similar parametric conditions. Therefore, diphones are e ective subword units, which carry suprasegmental knowledge of speech signals and provide an excellent compromise between detailed co-articulation modelling and acceptable system performance 2009-03-02T15:44:42Z 2010-06-01T08:33:03Z 2009-03-02T15:44:42Z 2010-06-01T08:33:03Z 2009-03 Thesis http://hdl.handle.net/10019.1/1779 en University of Stellenbosch application/pdf Stellenbosch : University of Stellenbosch
spellingShingle	Phoneme recognition Diphones Acoustic speech modelling Theses -- Electrical and electronic engineering Dissertations -- Electrical and electronic engineering Automatic speech recognition Electrical and Electronic Engineering Du Preez, Marianne Fast accurate diphone-based phoneme recognition
title	Fast accurate diphone-based phoneme recognition
title_full	Fast accurate diphone-based phoneme recognition
title_fullStr	Fast accurate diphone-based phoneme recognition
title_full_unstemmed	Fast accurate diphone-based phoneme recognition
title_short	Fast accurate diphone-based phoneme recognition
title_sort	fast accurate diphone based phoneme recognition
topic	Phoneme recognition Diphones Acoustic speech modelling Theses -- Electrical and electronic engineering Dissertations -- Electrical and electronic engineering Automatic speech recognition Electrical and Electronic Engineering
url	http://hdl.handle.net/10019.1/1779
work_keys_str_mv	AT dupreezmarianne fastaccuratediphonebasedphonemerecognition

Full Text Available

Fast accurate diphone-based phoneme recognition

Similar Items