Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Modern gradient boosting

Thesis (MCom)--Stellenbosch University, 2024.

Saved in:
Bibliographic Details
Main Author: Zackey, Matthew David
Other Authors: Uys, Daniel Wilhelm
Format: Thesis
Language:en_ZA
Published: Stellenbosch : Stellenbosch University 2024
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613936532062208
access_status_str Open Access
author Zackey, Matthew David
author2 Uys, Daniel Wilhelm
author_browse Uys, Daniel Wilhelm
Zackey, Matthew David
author_facet Uys, Daniel Wilhelm
Zackey, Matthew David
author_sort Zackey, Matthew David
collection Thesis
dc_rights_str_mv Stellenbosch University
description Thesis (MCom)--Stellenbosch University, 2024.
format Thesis
id oai:scholar.sun.ac.za:10019.1/130204
institution Stellenbosch University (South Africa)
language en_ZA
last_indexed 2026-06-10T12:44:04.029Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2024
publishDateRange 2024
publishDateSort 2024
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/130204 Modern gradient boosting Zackey, Matthew David Uys, Daniel Wilhelm Steel, Sarel Johannes Stellenbosch University. Faculty of Economic and Management Sciences. Dept. of Statistics and Actuarial Science. Machine learning Supervised learning (Machine learning) Machine learning -- Statistical methods Statistics -- Data processing UCTD Thesis (MCom)--Stellenbosch University, 2024. ENGLISH SUMMARY: Boosting is a supervised learning procedure that has gained considerable interest in statistical and machine learning owing to its powerful predictive performance. The idea of boosting is to obtain a model ensemble by sequentially fitting base learners to modified versions of the training data. The first complete boosting procedure was Adaptive boosting (AdaBoost), designed for binary classification. Gradient boosting followed AdaBoost, which allowed boosting to be applied to any differentiable and continuous loss function. The most frequently used version of gradient boosting is Multiple Additive Regression Trees (MART), where trees are specified as the base learners. In the last several years, there have been numerous extensions to MART, aiming to improve its predictive performance and scalability. Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM) and Categorical Boosting (CatBoost) are three of these extensions, which in this thesis are termed the modern gradient boosting methods. The thesis introduces boosting by reviewing the details of AdaBoost, forward stagewise additive modelling (FSAM) and gradient boosting. Notably, the equivalence of AdaBoost and FSAM with the exponential loss is proven, FSAM for regression with trees is considered and the need for an efficient procedure like gradient boosting is emphasised. Additionally, two derivations of gradient boosting are provided. The first considers gradient boosting as an approximation to steepest descent of the empirical risk, while the second views gradient boosting as taking a quadratic approximation of FSAM. Since trees are a popular choice of base learner in gradient boosting, details will be given on MART. The remainder of the thesis studies the modern methods, focusing on the mathematical details of their novelties. Examples, illustrations, and simulations are given for some of these novelties to provide further clarity. Additionally, empirical studies investigating the generalisation performance of certain novelties are presented. More specifically, these empirical studies consider the performance of XGBoost’s regularisation parameters in tree-building, GOSS from LightGBM, the Plain and Ordered modes in CatBoost, and the cosine similarity to construct the trees in CatBoost. In these experiments, several binary classification datasets are considered with varying characteristics: size, class imbalance, sparsity and the inclusion of categorical features. AFRIKAANSE OPSOMMING: Versterking is ’n leer-onder-toesig prosedure met kragtige voorspellingsvermoens wat baie in statistieseen masjienleer gebruik word. Die idee van versterking is om basisleerders opeenvolgend op aangepaste weergawes van die leer-data te pas, om sodoende saamgevoegde model te ontwikkel. AdaBoost was die eerste volledige versterkingsprosedure vir binere klassifikasie. Gradientversterking, wat dit moontlik maak om versterking op enige differensieerbare en kontinue verliesfunksie toe te pas, het op AdaBoost gevolg. Die weergawe van gradientversterking wat die meeste gebruik word, is MART; hier word regressie bome as die basisleerders gebruik. In die afgelope paar jaar is verskeie uitbreidings van MART ontwikkel; die doel van hierdie uitbreidings was om die voorspellingsvermoe van modelle te verbeter. Drie van hierdie uitbreidings is XGBoost, LightGBM en CatBoost, en staan in hierdie tesis as moderne gradientversterkingsmetodes bekend. Die tesis lei versterking in deur die besonderhede van AdaBoost, FSAM en gradientversterking te hersien. Daar word bewys dat AdaBoost en FSAM ekwivalent is in die geval van eksponensiele verlies; FSAM vir regressie met bome word beskou, en die behoefte om doeltreffende prosedure, soos gradientversterking, te ontwikkel, word beklemtoon. Twee afleidings van gradientversterking word ook gegee. Die eerste beskou gradientversterking as ’n benadering tot die steilste afname van die empiriese risiko, terwyl die tweede gradientversterking as ’n kwadratiese benadering van FSAM beskou. Aangesien bome ’n gewilde keuse vir basisleerders in gradientversterking is, word besonderhede binne die konteks van MART gegee. Die res van die tesis bestudeer die moderne gradientversterkingsmetodes met die fokus op nuwe wiskundige besonderhede. Om verdere duidelikheid te gee word voorbeelde, illustrasies en simulasies van sommige van hierdie nuwe wiskundige besonderhede gegee. Empiriese studies wat die veralgemeende prestasie van die metodes ondersoek, word ook gegee. In besonder beskou die empiriese studies die prestasie van XGBoost se regulariseringsparameters wanneer bome gebou word, GOSS in LightGBM, die gewone en geordende modusse in CatBoost, en die kosinus-ooreenkoms om bome in CatBoost te bou. In hierdie eksperimente word verskeie binere klassifikasiedatastelle gebruik met verskillende eienskappe soos grootte, klas-wanbalans, ylheid en die insluiting van kategoriese veranderlikes. Masters 2024-02-26T08:34:03Z 2024-04-26T09:05:46Z 2024-02-26T08:34:03Z 2024-04-26T09:05:46Z 2024-03 Thesis https://scholar.sun.ac.za/handle/10019.1/130204 en_ZA Stellenbosch University xx, 141 pages : illustrations, includes annexures application/pdf Stellenbosch : Stellenbosch University
spellingShingle Machine learning
Supervised learning (Machine learning)
Machine learning -- Statistical methods
Statistics -- Data processing
UCTD
Zackey, Matthew David
Modern gradient boosting
title Modern gradient boosting
title_full Modern gradient boosting
title_fullStr Modern gradient boosting
title_full_unstemmed Modern gradient boosting
title_short Modern gradient boosting
title_sort modern gradient boosting
topic Machine learning
Supervised learning (Machine learning)
Machine learning -- Statistical methods
Statistics -- Data processing
UCTD
url https://scholar.sun.ac.za/handle/10019.1/130204
work_keys_str_mv AT zackeymatthewdavid moderngradientboosting