Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Wavelet-based speech enhancement : a statistical approach

Thesis (MScIng)--University of Stellenbosch, 2004.

Saved in:
Bibliographic Details
Main Author: Harmse, Wynand
Other Authors: Schwardt, L.
Format: Thesis
Language:en_ZA
Published: Stellenbosch : University of Stellenbosch 2011
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613736925134848
access_status_str Open Access
author Harmse, Wynand
author2 Schwardt, L.
author_browse Harmse, Wynand
Schwardt, L.
author_facet Schwardt, L.
Harmse, Wynand
author_sort Harmse, Wynand
collection Thesis
dc_rights_str_mv University of Stellenbosch
description Thesis (MScIng)--University of Stellenbosch, 2004.
format Thesis
id oai:scholar.sun.ac.za:10019.1/16336
institution Stellenbosch University (South Africa)
language en_ZA
last_indexed 2026-06-10T12:40:53.839Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2011
publishDateRange 2011
publishDateSort 2011
publisher Stellenbosch : University of Stellenbosch
publisherStr Stellenbosch : University of Stellenbosch
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/16336 Wavelet-based speech enhancement : a statistical approach Harmse, Wynand Schwardt, L. University of Stellenbosch. Faculty of Engineering. Dept. of Electric and Electronic Engineering. Speech synthesis Speech processing systems Theses -- Electronic engineering Dissertations -- Electronic engineering Thesis (MScIng)--University of Stellenbosch, 2004. ENGLISH ABSTRACT: Speech enhancement is the process of removing background noise from speech signals. The equivalent process for images is known as image denoising. While the Fourier transform is widely used for speech enhancement, image denoising typically uses the wavelet transform. Research on wavelet-based speech enhancement has only recently emerged, yet it shows promising results compared to Fourier-based methods. This research is enhanced by the availability of new wavelet denoising algorithms based on the statistical modelling of wavelet coefficients, such as the hidden Markov tree. The aim of this research project is to investigate wavelet-based speech enhancement from a statistical perspective. Current Fourier-based speech enhancement and its evaluation process are described, and a framework is created for wavelet-based speech enhancement. Several wavelet denoising algorithms are investigated, and it is found that the algorithms based on the statistical properties of speech in the wavelet domain outperform the classical and more heuristic denoising techniques. The choice of wavelet influences the quality of the enhanced speech and the effect of this choice is therefore examined. The introduction of a noise floor parameter also improves the perceptual quality of the wavelet-based enhanced speech, by masking annoying residual artifacts. The performance of wavelet-based speech enhancement is similar to that of the more widely used Fourier methods at low noise levels, with a slight difference in the residual artifact. At high noise levels, however, the Fourier methods are superior. AFRIKAANSE OPSOMMING: Spraaksuiwering is die proses waardeur agtergrondgeraas uit spraakseine verwyder word. Die ekwivalente proses vir beelde word beeldsuiwering genoem. Terwyl spraaksuiwering in die algemeen in die Fourier-domein gedoen word, gebruik beeldsuiwering tipies die golfietransform. Navorsing oor golfie-gebaseerde spraaksuiwering het eers onlangs verskyn, en dit toon reeds belowende resultate in vergelyking met Fourier-gebaseerde metodes. Hierdie navorsingsveld word aangehelp deur die beskikbaarheid van nuwe golfie-gebaseerde suiweringstegnieke wat die golfie-ko¨effisi¨ente statisties modelleer, soos die verskuilde Markovboom. Die doel van hierdie navorsingsprojek is om golfie-gebaseerde spraaksuiwering vanuit ‘n statistiese oogpunt te bestudeer. Huidige Fourier-gebaseerde spraaksuiweringsmetodes asook die evalueringsproses vir sulke algoritmes word bespreek, en ‘n raamwerk word geskep vir golfie-gebaseerde spraaksuiwering. Verskeie golfie-gebaseerde algoritmes word ondersoek, en daar word gevind dat die metodes wat die statistiese eienskappe van spraak in die golfie-gebied gebruik, beter vaar as die klassieke en meer heuristiese metodes. Die keuse van golfie be¨ınvloed die kwaliteit van die gesuiwerde spraak, en die effek van hierdie keuse word dus ondersoek. Die gebruik van ‘n ruisvloer parameter verhoog ook die kwaliteit van die golfie-gesuiwerde spraak, deur steurende residuele artifakte te verberg. Die golfie-metodes vaar omtrent dieselfde as die klassieke Fourier-metodes by lae ruisvlakke, met ’n klein verskil in residuele artifakte. By ho¨e ruisvlakke vaar die Fouriermetodes egter steeds beter. 2011-09-01T12:31:33Z 2011-09-01T12:31:33Z 2004-12 Thesis http://hdl.handle.net/10019.1/16336 en_ZA University of Stellenbosch xxii, 162 leaves : ill. application/pdf Stellenbosch : University of Stellenbosch
spellingShingle Speech synthesis
Speech processing systems
Theses -- Electronic engineering
Dissertations -- Electronic engineering
Harmse, Wynand
Wavelet-based speech enhancement : a statistical approach
title Wavelet-based speech enhancement : a statistical approach
title_full Wavelet-based speech enhancement : a statistical approach
title_fullStr Wavelet-based speech enhancement : a statistical approach
title_full_unstemmed Wavelet-based speech enhancement : a statistical approach
title_short Wavelet-based speech enhancement : a statistical approach
title_sort wavelet based speech enhancement a statistical approach
topic Speech synthesis
Speech processing systems
Theses -- Electronic engineering
Dissertations -- Electronic engineering
url http://hdl.handle.net/10019.1/16336
work_keys_str_mv AT harmsewynand waveletbasedspeechenhancementastatisticalapproach