Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Discriminant analysis using sparse graphical models

Thesis (MCom)--Stellenbosch University, 2020.

Saved in:
Bibliographic Details
Main Author: Botha, Dylon
Other Authors: Kamper, Francois
Format: Thesis
Language:en_ZA
Published: Stellenbosch : Stellenbosch University 2020
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613906788155392
access_status_str Open Access
author Botha, Dylon
author2 Kamper, Francois
author_browse Botha, Dylon
Kamper, Francois
author_facet Kamper, Francois
Botha, Dylon
author_sort Botha, Dylon
collection Thesis
dc_rights_str_mv Stellenbosch University
description Thesis (MCom)--Stellenbosch University, 2020.
format Thesis
id oai:scholar.sun.ac.za:10019.1/107978
institution Stellenbosch University (South Africa)
language en_ZA
last_indexed 2026-06-10T12:43:35.721Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2020
publishDateRange 2020
publishDateSort 2020
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/107978 Discriminant analysis using sparse graphical models Botha, Dylon Kamper, Francois Bierman, Surette Stellenbosch University. Faculty of Economic and Management Sciences. Dept. of Statistics and Actuarial Science. Gaussian distribution -- Graphic methods Graphical modeling (Statistics) Sparse grids Inverse Gaussian distribution Multivariate analysis -- Graphic methods Discriminant analysis -- Graphic methods UCTD Thesis (MCom)--Stellenbosch University, 2020. ENGLISH SUMMARY : The objective of this thesis is the proposal of a new classification method. This classification method is an extension of classical quadratic discriminant analysis (QDA), where the focus is placed on relaxing the assumption of normality, and on overcoming the adverse effect of the large number of parameters that needs to be estimated when applying QDA. To relax the assumption of normality, we consider assigning to each class density a different nonparanormal distribution. Based on these nonparanormal distributions, new discriminant functions can be derived. When one considers the use of a nonparanormal distribution, the underlying assumption is that the associated random vector, can through the use of an appropriate transformation, be made to follow a Gaussian distribution. Such a transformation is based on the marginals of the distribution, which is to be estimated in a nonparametric way. The large number of parameters in QDA is a result of the estimation of class precision matrices. To overcome this problem, penalised maximum likelihood estimation is performed by placing an L1 penalty on the size of the elements in the class precision matrices. This leads to sparse precision matrix estimates, and therefore also to a reduction in the number of estimated parameters. Combining the above approaches to overcome the problems induced by nonnormality and a large number of parameters to estimate, leads to the following novel classification method. To each class density, a separate transformation is applied. Thereafter L1 penalised maximum likelihood estimation is performed in the transformed space. The resulting parameter estimates are then plugged into the nonparanormal discriminant functions, thereby facilitating classification. An empirical evaluation of the novel proposal shows it to be competitive with a wide array of existing classifiers. We also establish a connection to probabilistic graphical models, which could aid in the interpretation of this new technique. AFRIKAANSE OPSOMMING : Die doelwit van hierdie tesis is die voorstel van ’n nuwe klassifikasie-metode. Hierdie klassifikasie-metode is ’n uitbreiding van klassieke kwadratiese diskriminant-analise (KDA), waarin die normaliteits-aanname van KDA verslap word, en waarin die negatiewe effek van die groot aantal parameters wat beraam moet word in KDA toepassings, aangespreek word. Ter verslapping van die normaliteits-aanname beskou ons die toekenning van verskillende nie-paranormale verdelings aan elke klas. Op grond van hierdie nie-paranormale digtheidsfunksies kan nuwe diskriminantfunksies afgelei word. Wanneer ’n nie-paranormale verdeling veronderstel word, is die onderliggende aanname dat die geassosieerde vektor van stogastiese veranderlikes na ’n normaalverdeling transformeer kan word. Hierdie transformasie is gebaseer op die marginale verdelings, wat weer op ’n nie-parametriese wyse beraam word. Die groot aantal parameters in KDA is die gevolg van die beraming van presisiematrikse vir elke klas. Om hierdie probleem te oorkom, word gepenaliseerde maksimum aanneemlik-heidsberaming toegepas, spesifiek deur L1-penalisering op die groote van die elemente in die presisiematrikse. Dit lei tot ’n patroon van skaarsheid in die inverse kovariansiematrikse, en derhalwe ook tot ’n vermindering in die aantal beraamde parameters. Die samevoeging van die bogaande twee benaderings ten einde die probleme veroorsaak deur nie-normaliteit en die groot aantal parameters om te beraam, te oorkom, lei tot die volgende nuwe klassifikasie-metode. Vir elke klasdigtheid word ’n aparte transformasie toegepas. Daarna word L1-gepenaliseerde maksimum aanneemlikheidsberaming in die getransformeerde ruimte toegepas. Die beramings wat sodoende gevind word, word dan by die nie-paranormale diskriminant funksies ingestel ten einde klassifikasie te doen. Empiriese evaluering van die nuwe tegniek wys dat dit goed vergelyk met bestaande klassifikasie-metodes. Ons bevestig ook ’n verwantskap met grafiese modelle, wat moontlik kan bydra tot interpretasie van die nuwe tegniek. Masters 2020-02-18T17:58:52Z 2020-04-28T12:12:32Z 2020-02-18T17:58:52Z 2020-04-28T12:12:32Z 2020-03 Thesis http://hdl.handle.net/10019.1/107978 en_ZA Stellenbosch University xii, 113 pages ; illustrations, includes annexure application/pdf Stellenbosch : Stellenbosch University
spellingShingle Gaussian distribution -- Graphic methods
Graphical modeling (Statistics)
Sparse grids
Inverse Gaussian distribution
Multivariate analysis -- Graphic methods
Discriminant analysis -- Graphic methods
UCTD
Botha, Dylon
Discriminant analysis using sparse graphical models
title Discriminant analysis using sparse graphical models
title_full Discriminant analysis using sparse graphical models
title_fullStr Discriminant analysis using sparse graphical models
title_full_unstemmed Discriminant analysis using sparse graphical models
title_short Discriminant analysis using sparse graphical models
title_sort discriminant analysis using sparse graphical models
topic Gaussian distribution -- Graphic methods
Graphical modeling (Statistics)
Sparse grids
Inverse Gaussian distribution
Multivariate analysis -- Graphic methods
Discriminant analysis -- Graphic methods
UCTD
url http://hdl.handle.net/10019.1/107978
work_keys_str_mv AT bothadylon discriminantanalysisusingsparsegraphicalmodels