Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Corpus design for Setswana lexicography

Thesis (PhD (African Languages))--University of Pretoria, 2008.

Saved in:
Bibliographic Details
Other Authors: Kilgarriff, Adam
Format: Thesis
Published: University of Pretoria 2013
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613458409717760
access_status_str Open Access
author2 Kilgarriff, Adam
author_browse Kilgarriff, Adam
author_facet Kilgarriff, Adam
collection Thesis
dc_rights_str_mv © University of Pretoria 20
description Thesis (PhD (African Languages))--University of Pretoria, 2008.
format Thesis
id oai:repository.up.ac.za:2263/25972
institution University of Pretoria (South Africa)
last_indexed 2026-06-10T12:36:28.181Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from UPSpace — University of Pretoria Institutional Repository
publishDate 2013
publishDateRange 2013
publishDateSort 2013
publisher University of Pretoria
publisherStr University of Pretoria
record_format dspace
source_str UPSpace — University of Pretoria Institutional Repository
spelling oai:repository.up.ac.za:2263/25972 Corpus design for Setswana lexicography Kilgarriff, Adam Prinsloo, Danie J. (Daniel Jacobus), 1953- otlogets@mopipi.ub.bw Otlogetswe, Thapelo Joseph Lexicography Setswana Corpus design UCTD Thesis (PhD (African Languages))--University of Pretoria, 2008. This PhD thesis is about the design of a Setswana corpus for lexicography. While various corpora have been compiled and a variety of corpora-based researches attempted in African languages, no effort has been made towards corpus design. Additionally, although extensive analysis of the Setswana language has been done by missionaries, grammarians and linguists since the 1800s, none of such research is in corpus design. Most research has been largely on the grammatical study of the language. The recent corpora research in African languages in general has been on the use of corpora for the compilation of dictionaries and little of it is in corpus design. Pioneers of this kind of corpora research in African languages are Prinsloo and De Schryver (1999), De Schryver and Prisloo (2000 and 2001) and Gouws and Prisloo (2005). Because of a lack of research in corpora design particularly in African languages, this thesis is an attempt at filling that gap, especially for Setswana. It is hoped that the finding of this study will inspire similar designs in other languages comparable to Setswana. We explore corpus design by focusing on measuring a variety of text types for lexical richness at comparable token points. The study explores the question of whether a corpus compiled for lexicography must comprise a variety of texts drawn from different text types or whether the quality of retrieved information for lexicographic purposes from a corpus comprising diverse text varieties could be equally extracted from a corpus with a single text type. This study therefore determines whether linguistic variability is crucial in corpus design for lexicography. African Languages unrestricted 2013-09-07T01:45:09Z 2008-07-03 2013-09-07T01:45:09Z 2008-04-17 2008-07-03 2008-07-01 Thesis a 2007 http://hdl.handle.net/2263/25972 http://upetd.up.ac.za/thesis/available/etd-07012008-133107/ © University of Pretoria 20 application/pdf application/pdf application/pdf application/pdf application/pdf application/pdf application/pdf application/pdf application/pdf application/pdf University of Pretoria
spellingShingle Lexicography
Setswana
Corpus design
UCTD
Corpus design for Setswana lexicography
title Corpus design for Setswana lexicography
title_full Corpus design for Setswana lexicography
title_fullStr Corpus design for Setswana lexicography
title_full_unstemmed Corpus design for Setswana lexicography
title_short Corpus design for Setswana lexicography
title_sort corpus design for setswana lexicography
topic Lexicography
Setswana
Corpus design
UCTD
url http://hdl.handle.net/2263/25972
http://upetd.up.ac.za/thesis/available/etd-07012008-133107/