Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Binary classification trees : a comparison with popular classification methods in statistics using different software

Thesis (MComm) -- Stellenbosch University, 2002.

Saved in:
Bibliographic Details
Main Author: Lamont, Morné Michael Connell
Other Authors: Louw, N.
Format: Thesis
Language:en_ZA
Published: Stellenbosch : Stellenbosch University 2012
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613859801464832
access_status_str Open Access
author Lamont, Morné Michael Connell
author2 Louw, N.
author_browse Lamont, Morné Michael Connell
Louw, N.
author_facet Louw, N.
Lamont, Morné Michael Connell
author_sort Lamont, Morné Michael Connell
collection Thesis
dc_rights_str_mv Stellenbosch University
description Thesis (MComm) -- Stellenbosch University, 2002.
format Thesis
id oai:scholar.sun.ac.za:10019.1/52718
institution Stellenbosch University (South Africa)
language en_ZA
last_indexed 2026-06-10T12:42:50.594Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2012
publishDateRange 2012
publishDateSort 2012
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/52718 Binary classification trees : a comparison with popular classification methods in statistics using different software Lamont, Morné Michael Connell Louw, N. Stellenbosch University. Faculty of Economic and Management Sciences. Department of Statistics and Actuarial Science. Trees (Graph theory) Binary system (Mathematics) Discriminant analysis Mathematics -- Software Statistical methods Thesis (MComm) -- Stellenbosch University, 2002. ENGLISH ABSTRACT: Consider a data set with a categorical response variable and a set of explanatory variables. The response variable can have two or more categories and the explanatory variables can be numerical or categorical. This is a typical setup for a classification analysis, where we want to model the response based on the explanatory variables. Traditional statistical methods have been developed under certain assumptions such as: the explanatory variables are numeric only and! or the data follow a multivariate normal distribution. hl practice such assumptions are not always met. Different research fields generate data that have a mixed structure (categorical and numeric) and researchers are often interested using all these data in the analysis. hl recent years robust methods such as classification trees have become the substitute for traditional statistical methods when the above assumptions are violated. Classification trees are not only an effective classification method, but offer many other advantages. The aim of this thesis is to highlight the advantages of classification trees. hl the chapters that follow, the theory of and further developments on classification trees are discussed. This forms the foundation for the CART software which is discussed in Chapter 5, as well as other software in which classification tree modeling is possible. We will compare classification trees to parametric-, kernel- and k-nearest-neighbour discriminant analyses. A neural network is also compared to classification trees and finally we draw some conclusions on classification trees and its comparisons with other methods. AFRIKAANSE OPSOMMING: Beskou 'n datastel met 'n kategoriese respons veranderlike en 'n stel verklarende veranderlikes. Die respons veranderlike kan twee of meer kategorieë hê en die verklarende veranderlikes kan numeries of kategories wees. Hierdie is 'n tipiese opset vir 'n klassifikasie analise, waar ons die respons wil modelleer deur gebruik te maak van die verklarende veranderlikes. Tradisionele statistiese metodes is ontwikkelonder sekere aannames soos: die verklarende veranderlikes is slegs numeries en! of dat die data 'n meerveranderlike normaal verdeling het. In die praktyk word daar nie altyd voldoen aan hierdie aannames nie. Verskillende navorsingsvelde genereer data wat 'n gemengde struktuur het (kategories en numeries) en navorsers wil soms al hierdie data gebruik in die analise. In die afgelope jare het robuuste metodes soos klassifikasie bome die alternatief geword vir tradisionele statistiese metodes as daar nie aan bogenoemde aannames voldoen word nie. Klassifikasie bome is nie net 'n effektiewe klassifikasie metode nie, maar bied baie meer voordele. Die doel van hierdie werkstuk is om die voordele van klassifikasie bome uit te wys. In die hoofstukke wat volg word die teorie en verdere ontwikkelinge van klassifikasie bome bespreek. Hierdie vorm die fondament vir die CART sagteware wat bespreek word in Hoofstuk 5, asook ander sagteware waarin klassifikasie boom modelering moontlik is. Ons sal klassifikasie bome vergelyk met parametriese-, "kernel"- en "k-nearest-neighbour" diskriminant analise. 'n Neurale netwerk word ook vergelyk met klassifikasie bome en ten slote word daar gevolgtrekkings gemaak oor klassifikasie bome en hoe dit vergelyk met ander metodes. Masters 2012-08-27T11:35:07Z 2012-08-27T11:35:07Z 2002-12 Thesis http://hdl.handle.net/10019.1/52718 en_ZA Stellenbosch University ix, 92 pages : illustrations application/pdf Stellenbosch : Stellenbosch University
spellingShingle Trees (Graph theory)
Binary system (Mathematics)
Discriminant analysis
Mathematics -- Software
Statistical methods
Lamont, Morné Michael Connell
Binary classification trees : a comparison with popular classification methods in statistics using different software
title Binary classification trees : a comparison with popular classification methods in statistics using different software
title_full Binary classification trees : a comparison with popular classification methods in statistics using different software
title_fullStr Binary classification trees : a comparison with popular classification methods in statistics using different software
title_full_unstemmed Binary classification trees : a comparison with popular classification methods in statistics using different software
title_short Binary classification trees : a comparison with popular classification methods in statistics using different software
title_sort binary classification trees a comparison with popular classification methods in statistics using different software
topic Trees (Graph theory)
Binary system (Mathematics)
Discriminant analysis
Mathematics -- Software
Statistical methods
url http://hdl.handle.net/10019.1/52718
work_keys_str_mv AT lamontmornemichaelconnell binaryclassificationtreesacomparisonwithpopularclassificationmethodsinstatisticsusingdifferentsoftware