Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Variable selection for kernel methods with application to binary classification

Thesis (PhD (Statistics and Actuarial Science))—University of Stellenbosch, 2008.

Saved in:
Bibliographic Details
Main Author: Oosthuizen, Surette
Other Authors: Steel, S. J.
Format: Thesis
Language:English
Published: Stellenbosch : University of Stellenbosch 2008
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867614097430806528
access_status_str Open Access
author Oosthuizen, Surette
author2 Steel, S. J.
author_browse Oosthuizen, Surette
Steel, S. J.
author_facet Steel, S. J.
Oosthuizen, Surette
author_sort Oosthuizen, Surette
collection Thesis
dc_rights_str_mv University of Stellenbosch
Theses -- Statistics and actuarial science
description Thesis (PhD (Statistics and Actuarial Science))—University of Stellenbosch, 2008.
format Thesis
id oai:scholar.sun.ac.za:10019.1/1301
institution Stellenbosch University (South Africa)
language English
last_indexed 2026-06-10T12:46:37.536Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2008
publishDateRange 2008
publishDateSort 2008
publisher Stellenbosch : University of Stellenbosch
publisherStr Stellenbosch : University of Stellenbosch
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/1301 Variable selection for kernel methods with application to binary classification Oosthuizen, Surette Steel, S. J. University of Stellenbosch. Faculty of Economic and Management Sciences. Dept. of Statistics and Actuarial Science. Variable selection Support vector machines Kernel Fisher discriminant analysis Dissertations -- Statistics and actuarial science Thesis (PhD (Statistics and Actuarial Science))—University of Stellenbosch, 2008. The problem of variable selection in binary kernel classification is addressed in this thesis. Kernel methods are fairly recent additions to the statistical toolbox, having originated approximately two decades ago in machine learning and artificial intelligence. These methods are growing in popularity and are already frequently applied in regression and classification problems. Variable selection is an important step in many statistical applications. Thereby a better understanding of the problem being investigated is achieved, and subsequent analyses of the data frequently yield more accurate results if irrelevant variables have been eliminated. It is therefore obviously important to investigate aspects of variable selection for kernel methods. Chapter 2 of the thesis is an introduction to the main part presented in Chapters 3 to 6. In Chapter 2 some general background material on kernel methods is firstly provided, along with an introduction to variable selection. Empirical evidence is presented substantiating the claim that variable selection is a worthwhile enterprise in kernel classification problems. Several aspects which complicate variable selection in kernel methods are discussed. An important property of kernel methods is that the original data are effectively transformed before a classification algorithm is applied to it. The space in which the original data reside is called input space, while the transformed data occupy part of a feature space. In Chapter 3 we investigate whether variable selection should be performed in input space or rather in feature space. A new approach to selection, so-called feature-toinput space selection, is also proposed. This approach has the attractive property of combining information generated in feature space with easy interpretation in input space. An empirical study reveals that effective variable selection requires utilisation of at least some information from feature space. Having confirmed in Chapter 3 that variable selection should preferably be done in feature space, the focus in Chapter 4 is on two classes of selecion criteria operating in feature space: criteria which are independent of the specific kernel classification algorithm and criteria which depend on this algorithm. In this regard we concentrate on two kernel classifiers, viz. support vector machines and kernel Fisher discriminant analysis, both of which are described in some detail in Chapter 4. The chapter closes with a simulation study showing that two of the algorithm-independent criteria are very competitive with the more sophisticated algorithm-dependent ones. In Chapter 5 we incorporate a specific strategy for searching through the space of variable subsets into our investigation. Evidence in the literature strongly suggests that backward elimination is preferable to forward selection in this regard, and we therefore focus on recursive feature elimination. Zero- and first-order forms of the new selection criteria proposed earlier in the thesis are presented for use in recursive feature elimination and their properties are investigated in a numerical study. It is found that some of the simpler zeroorder criteria perform better than the more complicated first-order ones. Up to the end of Chapter 5 it is assumed that the number of variables to select is known. We do away with this restriction in Chapter 6 and propose a simple criterion which uses the data to identify this number when a support vector machine is used. The proposed criterion is investigated in a simulation study and compared to cross-validation, which can also be used for this purpose. We find that the proposed criterion performs well. The thesis concludes in Chapter 7 with a summary and several discussions for further research. Doctoral 2008-06-19T13:45:02Z 2010-06-01T08:18:00Z 2008-06-19T13:45:02Z 2010-06-01T08:18:00Z 2008-03 Thesis http://hdl.handle.net/10019.1/1301 en University of Stellenbosch Theses -- Statistics and actuarial science application/pdf Stellenbosch : University of Stellenbosch
spellingShingle Variable selection
Support vector machines
Kernel Fisher discriminant analysis
Dissertations -- Statistics and actuarial science
Oosthuizen, Surette
Variable selection for kernel methods with application to binary classification
title Variable selection for kernel methods with application to binary classification
title_full Variable selection for kernel methods with application to binary classification
title_fullStr Variable selection for kernel methods with application to binary classification
title_full_unstemmed Variable selection for kernel methods with application to binary classification
title_short Variable selection for kernel methods with application to binary classification
title_sort variable selection for kernel methods with application to binary classification
topic Variable selection
Support vector machines
Kernel Fisher discriminant analysis
Dissertations -- Statistics and actuarial science
url http://hdl.handle.net/10019.1/1301
work_keys_str_mv AT oosthuizensurette variableselectionforkernelmethodswithapplicationtobinaryclassification