Full Text Available

Access Repository

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Automatic syllabification of untranscribed speech

Thesis (MScEng)--Stellenbosch University, 2005.

Saved in:

Bibliographic Details
Main Author:	Nel, Pieter Willem
Other Authors:	Du Preez, J. A.
Format:	Thesis
Language:	en_ZA
Published:	Stellenbosch : Stellenbosch University 2012
Subjects:	Automatic speech recognition Speech processing systems Dissertations > Electronic engineering
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1867614053943214080
access_status_str	Open Access
author	Nel, Pieter Willem
author2	Du Preez, J. A.
author_browse	Du Preez, J. A. Nel, Pieter Willem
author_facet	Du Preez, J. A. Nel, Pieter Willem
author_sort	Nel, Pieter Willem
collection	Thesis
dc_rights_str_mv	Stellenbosch University
description	Thesis (MScEng)--Stellenbosch University, 2005.
format	Thesis
id	oai:scholar.sun.ac.za:10019.1/50285
institution	Stellenbosch University (South Africa)
language	en_ZA
last_indexed	2026-06-10T12:45:56.159Z
license_str	Other — see source repository
provenance_str_mv	Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate	2012
publishDateRange	2012
publishDateSort	2012
publisher	Stellenbosch : Stellenbosch University
publisherStr	Stellenbosch : Stellenbosch University
record_format	dspace
source_str	SUNScholar — Stellenbosch University Repository
spelling	oai:scholar.sun.ac.za:10019.1/50285 Automatic syllabification of untranscribed speech Nel, Pieter Willem Du Preez, J. A. Stellenbosch University. Faculty of Engineering. Dept. of Electrical and Electronic Engineering. Automatic speech recognition Speech processing systems Dissertations -- Electronic engineering Thesis (MScEng)--Stellenbosch University, 2005. ENGLISH ABSTRACT: The syllable has been proposed as a unit of automatic speech recognition due to its strong links with human speech production and perception. Recently, it has been proved that incorporating information from syllable-length time-scales into automatic speech recognition improves results in large vocabulary recognition tasks. It was also shown to aid in various language recognition tasks and in foreign accent identification. Therefore, the ability to automatically segment speech into syllables is an important research tool. Where most previous studies employed knowledge-based methods, this study presents a purely statistical method for the automatic syllabification of speech. We introduce the concept of hierarchical hidden Markov model structures and show how these can be used to implement a purely acoustical syllable segmenter based, on general sonority theory, combined with some of the phonotactic constraints found in the English language. The accurate reporting of syllabification results is a problem in the existing literature. We present a well-defined dynamic time warping (DTW) distance measure used for reporting syllabification results. We achieve a token error rate of 20.3% with a 42ms average boundary error on a relatively large set of data. This compares well with previous knowledge-based and statistically- based methods. AFRIKAANSE OPSOMMING: Die syllabe is voorheen voorgestel as 'n basiese eenheid vir automatiese spraakherkenning weens die sterk verwantwskap wat dit het met spraak produksie en persepsie. Onlangs is dit bewys dat die gebruik van informasie van syllabe-lengte tydskale die resultate verbeter in groot woordeskat herkennings take. Dit is ook bewys dat die gebruik van syllabes automatiese taalherkenning en vreemdetaal aksent herkenning vergemaklik. Dit is daarom belangrik om vir navorsingsdoeleindes syllabes automaties te kan segmenteer. Vorige studies het kennisgebaseerde metodes gebruik om hierdie segmentasie te bewerkstellig. Hierdie studie gebruik 'n suiwer statistiese metode vir die automatiese syllabifikasie van spraak. Ons gebruik die konsep van hierargiese verskuilde Markov model strukture en wys hoe dit gebruik kan word om 'n suiwer akoestiese syllabe segmenteerder te implementeer. Die model word gebou deur dit te baseer op die teorie van sonoriteit asook die fonotaktiese beperkinge teenwoordig in die Engelse taal. Die akkurate voorstelling van syllabifikasie resultate is problematies in die bestaande literatuur. Ons definieer volledig 'n DTW (Dynamic Time Warping) afstands funksie waarmee ons ons syllabifikasie resultate weergee. Ons behaal 'n TER (Token Error Rate) van 20.3% met 'n 42ms gemiddelde grens fout op 'n relatiewe groot stel data. Dit vergelyk goed met vorige kennis-gebaseerde en statisties-gebaseerde metodes. 2012-08-27T11:33:20Z 2012-08-27T11:33:20Z 2005-03 Thesis http://hdl.handle.net/10019.1/50285 en_ZA Stellenbosch University 76 p. : ill. application/pdf Stellenbosch : Stellenbosch University
spellingShingle	Automatic speech recognition Speech processing systems Dissertations -- Electronic engineering Nel, Pieter Willem Automatic syllabification of untranscribed speech
title	Automatic syllabification of untranscribed speech
title_full	Automatic syllabification of untranscribed speech
title_fullStr	Automatic syllabification of untranscribed speech
title_full_unstemmed	Automatic syllabification of untranscribed speech
title_short	Automatic syllabification of untranscribed speech
title_sort	automatic syllabification of untranscribed speech
topic	Automatic speech recognition Speech processing systems Dissertations -- Electronic engineering
url	http://hdl.handle.net/10019.1/50285
work_keys_str_mv	AT nelpieterwillem automaticsyllabificationofuntranscribedspeech

Full Text Available

Automatic syllabification of untranscribed speech

Similar Items