Full Text Available

Access Repository

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Modern gradient boosting

Thesis (MCom)--Stellenbosch University, 2024.

Saved in:

Bibliographic Details
Main Author:	Zackey, Matthew David
Other Authors:	Uys, Daniel Wilhelm
Format:	Thesis
Language:	en_ZA
Published:	Stellenbosch : Stellenbosch University 2024
Subjects:	Machine learning Supervised learning (Machine learning) Machine learning > Statistical methods Statistics > Data processing UCTD
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1867613936532062208
access_status_str	Open Access
author	Zackey, Matthew David
author2	Uys, Daniel Wilhelm
author_browse	Uys, Daniel Wilhelm Zackey, Matthew David
author_facet	Uys, Daniel Wilhelm Zackey, Matthew David
author_sort	Zackey, Matthew David
collection	Thesis
dc_rights_str_mv	Stellenbosch University
description	Thesis (MCom)--Stellenbosch University, 2024.
format	Thesis
id	oai:scholar.sun.ac.za:10019.1/130204
institution	Stellenbosch University (South Africa)
language	en_ZA
last_indexed	2026-06-10T12:44:04.029Z
license_str	Other — see source repository
provenance_str_mv	Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate	2024
publishDateRange	2024
publishDateSort	2024
publisher	Stellenbosch : Stellenbosch University
publisherStr	Stellenbosch : Stellenbosch University
record_format	dspace
source_str	SUNScholar — Stellenbosch University Repository
spelling	oai:scholar.sun.ac.za:10019.1/130204 Modern gradient boosting Zackey, Matthew David Uys, Daniel Wilhelm Steel, Sarel Johannes Stellenbosch University. Faculty of Economic and Management Sciences. Dept. of Statistics and Actuarial Science. Machine learning Supervised learning (Machine learning) Machine learning -- Statistical methods Statistics -- Data processing UCTD Thesis (MCom)--Stellenbosch University, 2024. ENGLISH SUMMARY: Boosting is a supervised learning procedure that has gained considerable interest in statistical and machine learning owing to its powerful predictive performance. The idea of boosting is to obtain a model ensemble by sequentially fitting base learners to modified versions of the training data. The first complete boosting procedure was Adaptive boosting (AdaBoost), designed for binary classification. Gradient boosting followed AdaBoost, which allowed boosting to be applied to any differentiable and continuous loss function. The most frequently used version of gradient boosting is Multiple Additive Regression Trees (MART), where trees are specified as the base learners. In the last several years, there have been numerous extensions to MART, aiming to improve its predictive performance and scalability. Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM) and Categorical Boosting (CatBoost) are three of these extensions, which in this thesis are termed the modern gradient boosting methods. The thesis introduces boosting by reviewing the details of AdaBoost, forward stagewise additive modelling (FSAM) and gradient boosting. Notably, the equivalence of AdaBoost and FSAM with the exponential loss is proven, FSAM for regression with trees is considered and the need for an efficient procedure like gradient boosting is emphasised. Additionally, two derivations of gradient boosting are provided. The first considers gradient boosting as an approximation to steepest descent of the empirical risk, while the second views gradient boosting as taking a quadratic approximation of FSAM. Since trees are a popular choice of base learner in gradient boosting, details will be given on MART. The remainder of the thesis studies the modern methods, focusing on the mathematical details of their novelties. Examples, illustrations, and simulations are given for some of these novelties to provide further clarity. Additionally, empirical studies investigating the generalisation performance of certain novelties are presented. More specifically, these empirical studies consider the performance of XGBoost’s regularisation parameters in tree-building, GOSS from LightGBM, the Plain and Ordered modes in CatBoost, and the cosine similarity to construct the trees in CatBoost. In these experiments, several binary classification datasets are considered with varying characteristics: size, class imbalance, sparsity and the inclusion of categorical features. AFRIKAANSE OPSOMMING: Versterking is ’n leer-onder-toesig prosedure met kragtige voorspellingsvermoens wat baie in statistieseen masjienleer gebruik word. Die idee van versterking is om basisleerders opeenvolgend op aangepaste weergawes van die leer-data te pas, om sodoende saamgevoegde model te ontwikkel. AdaBoost was die eerste volledige versterkingsprosedure vir binere klassifikasie. Gradientversterking, wat dit moontlik maak om versterking op enige differensieerbare en kontinue verliesfunksie toe te pas, het op AdaBoost gevolg. Die weergawe van gradientversterking wat die meeste gebruik word, is MART; hier word regressie bome as die basisleerders gebruik. In die afgelope paar jaar is verskeie uitbreidings van MART ontwikkel; die doel van hierdie uitbreidings was om die voorspellingsvermoe van modelle te verbeter. Drie van hierdie uitbreidings is XGBoost, LightGBM en CatBoost, en staan in hierdie tesis as moderne gradientversterkingsmetodes bekend. Die tesis lei versterking in deur die besonderhede van AdaBoost, FSAM en gradientversterking te hersien. Daar word bewys dat AdaBoost en FSAM ekwivalent is in die geval van eksponensiele verlies; FSAM vir regressie met bome word beskou, en die behoefte om doeltreffende prosedure, soos gradientversterking, te ontwikkel, word beklemtoon. Twee afleidings van gradientversterking word ook gegee. Die eerste beskou gradientversterking as ’n benadering tot die steilste afname van die empiriese risiko, terwyl die tweede gradientversterking as ’n kwadratiese benadering van FSAM beskou. Aangesien bome ’n gewilde keuse vir basisleerders in gradientversterking is, word besonderhede binne die konteks van MART gegee. Die res van die tesis bestudeer die moderne gradientversterkingsmetodes met die fokus op nuwe wiskundige besonderhede. Om verdere duidelikheid te gee word voorbeelde, illustrasies en simulasies van sommige van hierdie nuwe wiskundige besonderhede gegee. Empiriese studies wat die veralgemeende prestasie van die metodes ondersoek, word ook gegee. In besonder beskou die empiriese studies die prestasie van XGBoost se regulariseringsparameters wanneer bome gebou word, GOSS in LightGBM, die gewone en geordende modusse in CatBoost, en die kosinus-ooreenkoms om bome in CatBoost te bou. In hierdie eksperimente word verskeie binere klassifikasiedatastelle gebruik met verskillende eienskappe soos grootte, klas-wanbalans, ylheid en die insluiting van kategoriese veranderlikes. Masters 2024-02-26T08:34:03Z 2024-04-26T09:05:46Z 2024-02-26T08:34:03Z 2024-04-26T09:05:46Z 2024-03 Thesis https://scholar.sun.ac.za/handle/10019.1/130204 en_ZA Stellenbosch University xx, 141 pages : illustrations, includes annexures application/pdf Stellenbosch : Stellenbosch University
spellingShingle	Machine learning Supervised learning (Machine learning) Machine learning -- Statistical methods Statistics -- Data processing UCTD Zackey, Matthew David Modern gradient boosting
title	Modern gradient boosting
title_full	Modern gradient boosting
title_fullStr	Modern gradient boosting
title_full_unstemmed	Modern gradient boosting
title_short	Modern gradient boosting
title_sort	modern gradient boosting
topic	Machine learning Supervised learning (Machine learning) Machine learning -- Statistical methods Statistics -- Data processing UCTD
url	https://scholar.sun.ac.za/handle/10019.1/130204
work_keys_str_mv	AT zackeymatthewdavid moderngradientboosting

Full Text Available

Modern gradient boosting

Similar Items