Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Building Speech-Driven Assessment Tools for Afrikaans and isiXhosa Children

Thesis (MEng)--Stellenbosch University, 2026.

Saved in:
Bibliographic Details
Main Author: Louw, Retief Gideon Arno
Other Authors: Kamper, H.
Format: Thesis
Language:English
Published: Stellenbosch : Stellenbosch University 2026
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613935137456128
access_status_str Open Access
author Louw, Retief Gideon Arno
author2 Kamper, H.
author_browse Kamper, H.
Louw, Retief Gideon Arno
author_facet Kamper, H.
Louw, Retief Gideon Arno
author_sort Louw, Retief Gideon Arno
collection Thesis
dc_rights_str_mv Stellenbosch University
description Thesis (MEng)--Stellenbosch University, 2026.
format Thesis
id oai:scholar.sun.ac.za:10019.1/136280
institution Stellenbosch University (South Africa)
language English
last_indexed 2026-06-10T12:44:02.133Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2026
publishDateRange 2026
publishDateSort 2026
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/136280 Building Speech-Driven Assessment Tools for Afrikaans and isiXhosa Children Louw, Retief Gideon Arno Kamper, H. Strauss, Johannes Stellenbosch University. Faculty of Engineering. Dept. of Electrical and Electronic Engineering. Thesis (MEng)--Stellenbosch University, 2026. Louw, R. G. A. 2026. Building Speech-Driven Assessment Tools for Afrikaans and isiXhosa Children. Unpublished masters thesis. Stellenbosch: Stellenbosch University [online]. Available: https://scholar.sun.ac.za/items/7f7f81f1-c0dd-41f7-8440-777e61cc57d5 Illiteracy is a root cause of many socio-economic problems in South Africa. However, problems with reading often start before a child enters school. It is therefore essential to identify children with development problems early to allow for effective interventions. However, current language assessment tools are not designed for local languages, require trained administrators and lack the scalability needed for widespread screening in resource-constrained settings. We address this challenge by developing automated, speech-driven assessment tools for Afrikaans and isiXhosa. This thesis makes four contributions. First, we develop Mamela, an automatic speech recognition (ASR) system that we optimise for child speech for these lowresource languages. Second, we present an automated pipeline for language sample analysis (LSA) from oral child-adult interactions, which calculates key clinical metrics of language development. Third, we examine the multilingual assessment instrument for narratives (MAIN), a tool used by speech therapists to assess children’s development through oral narratives. We integrate Mamela into a pipeline with speaker diarisation, ASR, machine translation and a large language model (LLM) to predict the MAIN scores automatically. Fourth, we integrate the developed MAIN pipeline into a user-friendly mobile application for speech therapists and educators. For each contribution, we perform a corresponding evaluation. First, our ASR fine-tuning strategy improves the word error rates for Afrikaans by 72% and for isiXhosa by 61% relative to the baseline models. The resulting error rates, however, remain high compared to other high-resource languages. Second, the outputs of our automated LSA pipeline achieve an R2 value of 0.91 for the mean length of utterance and 0.95 for the total number of words. Third, our LLM-based classifier identifies children requiring intervention with an accuracy of over 80% for Afrikaans and 64% for isiXhosa. This performance is on par with human assessors despite using the imperfect ASR transcripts, suggesting that such end-to-end systems are viable in challenging acoustic conditions. Fourth, an expert panel of speech therapists and students evaluated the MAIN mobile application, showing that the application can potentially reduce the assessment time from 30 to 45 minutes to under one minute. Ninety-five percent of the panel agreed that the application can assist professionals in identifying children at risk of language development delay. This thesis demonstrates that integrating modern ASR and LLM technology can automate complex language assessments for low-resource languages. Our work translates this research into a practical tool, offering educators and speech therapists a scalable, accurate and low-effort method to identify at-risk children. Masters 2026-04-30T12:33:18Z 2026-04-30T12:33:18Z 2026-03 Thesis https://scholar.sun.ac.za/handle/10019.1/136280 en Stellenbosch University 98 pages application/pdf Stellenbosch : Stellenbosch University
spellingShingle Louw, Retief Gideon Arno
Building Speech-Driven Assessment Tools for Afrikaans and isiXhosa Children
title Building Speech-Driven Assessment Tools for Afrikaans and isiXhosa Children
title_full Building Speech-Driven Assessment Tools for Afrikaans and isiXhosa Children
title_fullStr Building Speech-Driven Assessment Tools for Afrikaans and isiXhosa Children
title_full_unstemmed Building Speech-Driven Assessment Tools for Afrikaans and isiXhosa Children
title_short Building Speech-Driven Assessment Tools for Afrikaans and isiXhosa Children
title_sort building speech driven assessment tools for afrikaans and isixhosa children
url https://scholar.sun.ac.za/handle/10019.1/136280
work_keys_str_mv AT louwretiefgideonarno buildingspeechdrivenassessmenttoolsforafrikaansandisixhosachildren