Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Investigation of the impact of high frequency transmitted speech on speaker recognition

Thesis (MScEng)--Stellenbosch University, 2002.

Saved in:
Bibliographic Details
Main Author: Pool, Jan
Other Authors: Du Preez, J. A.
Format: Thesis
Language:en_ZA
Published: Stellenbosch : Stellenbosch University 2012
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613992874147840
access_status_str Open Access
author Pool, Jan
author2 Du Preez, J. A.
author_browse Du Preez, J. A.
Pool, Jan
author_facet Du Preez, J. A.
Pool, Jan
author_sort Pool, Jan
collection Thesis
dc_rights_str_mv Stellenbosch University
description Thesis (MScEng)--Stellenbosch University, 2002.
format Thesis
id oai:scholar.sun.ac.za:10019.1/52895
institution Stellenbosch University (South Africa)
language en_ZA
last_indexed 2026-06-10T12:44:57.544Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2012
publishDateRange 2012
publishDateSort 2012
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/52895 Investigation of the impact of high frequency transmitted speech on speaker recognition Pool, Jan Du Preez, J. A. Lourens, J. G. Stellenbosch University. Faculty of Engineering. Dept. of Electrical and Electronic Engineering. Automatic speech recognition Speech processing systems Markov processes Dissertations -- Electronic engineering Thesis (MScEng)--Stellenbosch University, 2002. Some digitised pages may appear illegible due to the condition of the original hard copy. ENGLISH ABSTRACT: Speaker recognition systems have evolved to a point where near perfect performance can be obtained under ideal conditions, even if the system must distinguish between a large number of speakers. Under adverse conditions, such as when high noise levels are present or when the transmission channel deforms the speech, the performance is often less than satisfying. This project investigated the performance of a popular speaker recognition system, that use Gaussian mixture models, on speech transmitted over a high frequency channel. Initial experiments demonstrated very unsatisfactory results for the base line system. We investigated a number of robust techniques. We implemented and applied some of them in an attempt to improve the performance of the speaker recognition systems. The techniques we tested showed only slight improvements. We also investigates the effects of a high frequency channel and single sideband modulation on the speech features of speech processing systems. The effects that can deform the features, and therefore reduce the performance of speech systems, were identified. One of the effects that can greatly affect the performance of a speech processing system is noise. We investigated some speech enhancement techniques and as a result we developed a new statistical based speech enhancement technique that employs hidden Markov models to represent the clean speech process. AFRIKAANSE OPSOMMING: Sprekerherkenning-stelsels het 'n punt bereik waar nabyaan perfekte resultate verwag kan word onder ideale kondisies, selfs al moet die stelsel tussen 'n groot aantal sprekers onderskei. Wanneer nie-ideale kondisies, soos byvoorbeeld hoë ruisvlakke of 'n transmissie kanaal wat die spraak vervorm, teenwoordig is, is die resultate gewoonlik nie bevredigend nie. Die projek ondersoek die werksverrigting van 'n gewilde sprekerherkenning-stelsel, wat gebruik maak van Gaussiese mengselmodelle, op spraak wat oor 'n hoë frekwensie transmissie kanaal gestuur is. Aanvanklike eksperimente wat gebruik maak van 'n basiese stelsel het nie goeie resultate opgelewer nie. Ons het 'n aantal robuuste tegnieke ondersoek en 'n paar van hulle geïmplementeer en getoets in 'n poging om die resultate van die sprekerherkenning-stelsel te verbeter. Die tegnieke wat ons getoets het, het net geringe verbetering getoon. Die studie het ook die effekte wat die hoë-frekwensie kanaal en enkel-syband modulasie op spraak kenmerkvektore, ondersoek. Die effekte wat die spraak kenmerkvektore kan vervorm en dus die werkverrigting van spraak stelsels kan verlaag, is geïdentifiseer. Een van die effekte wat 'n groot invloed op die werkverrigting van spraakstelsels het, is ruis. Ons het spraak verbeterings metodes ondersoek en dit het gelei tot die ontwikkeling van 'n statisties gebaseerde spraak verbeteringstegniek wat gebruik maak van verskuilde Markov modelle om die skoon spraakproses voor te stel. 2012-08-27T11:35:12Z 2012-08-27T11:35:12Z 2002-04 Thesis http://hdl.handle.net/10019.1/52895 en_ZA Stellenbosch University 165 p. : ill. application/pdf Stellenbosch : Stellenbosch University
spellingShingle Automatic speech recognition
Speech processing systems
Markov processes
Dissertations -- Electronic engineering
Pool, Jan
Investigation of the impact of high frequency transmitted speech on speaker recognition
title Investigation of the impact of high frequency transmitted speech on speaker recognition
title_full Investigation of the impact of high frequency transmitted speech on speaker recognition
title_fullStr Investigation of the impact of high frequency transmitted speech on speaker recognition
title_full_unstemmed Investigation of the impact of high frequency transmitted speech on speaker recognition
title_short Investigation of the impact of high frequency transmitted speech on speaker recognition
title_sort investigation of the impact of high frequency transmitted speech on speaker recognition
topic Automatic speech recognition
Speech processing systems
Markov processes
Dissertations -- Electronic engineering
url http://hdl.handle.net/10019.1/52895
work_keys_str_mv AT pooljan investigationoftheimpactofhighfrequencytransmittedspeechonspeakerrecognition