Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Discriminative and Bayesian techniques for hidden Markov model speech recognition systems

Thesis (PhD (Electronic Engineering))--University of Pretoria, 2006.

Saved in:
Bibliographic Details
Other Authors: Botha, Elizabeth C.
Format: Thesis
Published: University of Pretoria 2013
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613693820272640
access_status_str Open Access
author2 Botha, Elizabeth C.
author_browse Botha, Elizabeth C.
author_facet Botha, Elizabeth C.
collection Thesis
dc_rights_str_mv © 2001, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
description Thesis (PhD (Electronic Engineering))--University of Pretoria, 2006.
format Thesis
id oai:repository.up.ac.za:2263/29158
institution University of Pretoria (South Africa)
last_indexed 2026-06-10T12:40:12.572Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from UPSpace — University of Pretoria Institutional Repository
publishDate 2013
publishDateRange 2013
publishDateSort 2013
publisher University of Pretoria
publisherStr University of Pretoria
record_format dspace
source_str UPSpace — University of Pretoria Institutional Repository
spelling oai:repository.up.ac.za:2263/29158 Discriminative and Bayesian techniques for hidden Markov model speech recognition systems Botha, Elizabeth C. upetd@up.ac.za Purnell, Darryl William Automatic speech recognition Bayesian adaptation Hidden markov model training UCTD Thesis (PhD (Electronic Engineering))--University of Pretoria, 2006. The collection of large speech databases is not a trivial task (if done properly). It is not always possible to collect, segment and annotate large databases for every task or language. It is also often the case that there are imbalances in the databases, as a result of little data being available for a specific subset of individuals. An example of one such imbalance is the fact that there are often more male speakers than female speakers (or vice-versa). If there are, for example, far fewer female speakers than male speakers, then the recognizers will tend to work poorly for female speakers (as compared to performance for male speakers). This thesis focuses on using Bayesian and discriminative training algorithms to improve continuous speech recognition systems in scenarios where there is a limited amount of training data available. The research reported in this thesis can be divided into three categories: • Overspecialization is characterized by good recognition performance for the data used during training, but poor recognition performance for independent testing data. This is a problem when too little data is available for training purposes. Methods of reducing overspecialization in the minimum classification error algo¬rithm are therefore investigated. • Development of new Bayesian and discriminative adaptation/training techniques that can be used in situations where there is a small amount of data available. One example here is the situation where an imbalance in terms of numbers of male and female speakers exists and these techniques can be used to improve recognition performance for female speakers, while not decreasing recognition performance for the male speakers. • Bayesian learning, where Bayesian training is used to improve recognition perfor¬mance in situations where one can only use the limited training data available. These methods are extremely computationally expensive, but are justified by the improved recognition rates for certain tasks. This is, to the author's knowledge, the first time that Bayesian learning using Markov chain Monte Carlo methods have been used in hidden Markov model speech recognition. The algorithms proposed and reviewed are tested using three different datasets (TIMIT, TIDIGITS and SUNSpeech), with the tasks being connected digit recognition and con¬tinuous speech recognition. Results indicate that the proposed algorithms improve recognition performance significantly for situations where little training data is avail¬able. Electrical, Electronic and Computer Engineering unrestricted 2013-09-07T15:00:35Z 2005-11-01 2013-09-07T15:00:35Z 2001-04-01 2006-11-01 2005-10-31 Thesis Purnell, DW 2001, Discriminative and Bayesian techniques for hidden Markov model speech recognition systems, PhD thesis, University of Pretoria, Pretoria, viewed yymmdd < http://hdl.handle.net/2263/29158 > H543/ag http://hdl.handle.net/2263/29158 http://upetd.up.ac.za/thesis/available/etd-10312005-142207/ © 2001, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria. application/pdf application/pdf application/pdf application/pdf application/pdf University of Pretoria
spellingShingle Automatic speech recognition
Bayesian adaptation
Hidden markov model training
UCTD
Discriminative and Bayesian techniques for hidden Markov model speech recognition systems
title Discriminative and Bayesian techniques for hidden Markov model speech recognition systems
title_full Discriminative and Bayesian techniques for hidden Markov model speech recognition systems
title_fullStr Discriminative and Bayesian techniques for hidden Markov model speech recognition systems
title_full_unstemmed Discriminative and Bayesian techniques for hidden Markov model speech recognition systems
title_short Discriminative and Bayesian techniques for hidden Markov model speech recognition systems
title_sort discriminative and bayesian techniques for hidden markov model speech recognition systems
topic Automatic speech recognition
Bayesian adaptation
Hidden markov model training
UCTD
url http://hdl.handle.net/2263/29158
http://upetd.up.ac.za/thesis/available/etd-10312005-142207/