Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Automatic assessment and feature-based characterisation of oral narratives for Afrikaans and isiXhosa children

Thesis (MEng)--Stellenbosch University, 2026.

Saved in:
Bibliographic Details
Main Author: Sharratt, Emma Lori
Other Authors: Kamper, Herman
Format: Thesis
Language:English
Published: Stellenbosch : Stellenbosch University 2026
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613776385146880
access_status_str Open Access
author Sharratt, Emma Lori
author2 Kamper, Herman
author_browse Kamper, Herman
Sharratt, Emma Lori
author_facet Kamper, Herman
Sharratt, Emma Lori
author_sort Sharratt, Emma Lori
collection Thesis
dc_rights_str_mv Stellenbosch University
description Thesis (MEng)--Stellenbosch University, 2026.
format Thesis
id oai:scholar.sun.ac.za:10019.1/135854
institution Stellenbosch University (South Africa)
language English
last_indexed 2026-06-10T12:41:31.332Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2026
publishDateRange 2026
publishDateSort 2026
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/135854 Automatic assessment and feature-based characterisation of oral narratives for Afrikaans and isiXhosa children Sharratt, Emma Lori Kamper, Herman Stellenbosch University. Faculty of Engineering. Dept. of Electrical and Electronic Engineering. Thesis (MEng)--Stellenbosch University, 2026. Sharratt, E. L. 2026. Automatic assessment and feature-based characterisation of oral narratives for Afrikaans and isiXhosa children. Unpublished masters thesis. Stellenbosch: Stellenbosch University [online]. Available: https://scholar.sun.ac.za/items/1d249289-3e1a-4d3f-a928-a181faaaf004 Developing narrative and comprehension skills in early childhood is critical for later literacy and academic success. Yet, less than 20% of South African 10-year-olds can read for meaning, highlighting an urgent need for early intervention. Accurate assessment is essential for identifying children who need support, but current methods rely on human judgement, which is resource-intensive and susceptible to inconsistencies. To address this, we design an automatic scoring system for low-income preschools, where large class sizes make it challenging for teachers to identify learners requiring assistance. Our research focuses on three core goals. First, we develop an automatic scoring model prioritising predictive accuracy. Our system employs automatic speech recognition (ASR), followed by linear and logistic regression models to predict language proficiency. To reduce the feature space, we apply an L1-regularised model to the vectorised text. Linear models are chosen for their simplicity and interpretability. We experiment with different linguistic units and scaling methods, finding that word-level tokens and raw feature values yield the best performance. We then compare our final linear model with a large language model (LLM) developed in parallel work. Although the LLM achieves higher accuracy in most cases, the linear model performs competitively given its simplicity. Second, beyond predictive accuracy, we examine which features the logistic model relies on most. Assuming perfect ASR transcriptions,we conduct quantitative analyses using permutation feature importance and exploratory data analysis, focusing on proficiency features, part-of speech counts and specific keywords. We also consult speech therapists to qualitatively interpret the relevance of these features relative to established measures of narrative development. Key indicators include lexical diversity and productivity, while speech production features such as articulation rate show unexpectedly limited predictive value. Although relevant keywords and phrases vary by language, certain verbs, auxiliaries and nouns linked to core story elements consistently indicate stronger narrative skills in Afrikaans and isiXhosa. Third, in collaboration with Retief Louw, we deploy and evaluate our system through a mobile application, Auto MAIN. We collect feedback from 20 adult participants, including qualified speech-language and hearing therapists and speech therapy students. While further refinements are required, 95% of participants agree that the app has the potential to support professionals in identifying children at risk of language delays. Together, these findings demonstrate the potential of integrating ASR and predictive machine learning models to assist teachers in the early detection of developmental delays. We hope this work provides a foundation for automatic oral narrative assessment tools in low-resource languages, informing targeted interventions in foundational phase classrooms. Masters 2026-04-13T12:27:26Z 2026-04-13T12:27:26Z 2026-03 Thesis https://scholar.sun.ac.za/handle/10019.1/135854 en Stellenbosch University 142 pages application/pdf Stellenbosch : Stellenbosch University
spellingShingle Sharratt, Emma Lori
Automatic assessment and feature-based characterisation of oral narratives for Afrikaans and isiXhosa children
title Automatic assessment and feature-based characterisation of oral narratives for Afrikaans and isiXhosa children
title_full Automatic assessment and feature-based characterisation of oral narratives for Afrikaans and isiXhosa children
title_fullStr Automatic assessment and feature-based characterisation of oral narratives for Afrikaans and isiXhosa children
title_full_unstemmed Automatic assessment and feature-based characterisation of oral narratives for Afrikaans and isiXhosa children
title_short Automatic assessment and feature-based characterisation of oral narratives for Afrikaans and isiXhosa children
title_sort automatic assessment and feature based characterisation of oral narratives for afrikaans and isixhosa children
url https://scholar.sun.ac.za/handle/10019.1/135854
work_keys_str_mv AT sharrattemmalori automaticassessmentandfeaturebasedcharacterisationoforalnarrativesforafrikaansandisixhosachildren