Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE

The prediction of long-term share returns is an essential yet complex task in financial analysis and formulating investment strategy. Machine learning is a promising approach for improving the accuracy of these predictions. However, the outputs of machine learning models are not transparent or inter...

Full description

Saved in:
Bibliographic Details
Main Author: Boakes, Jamie
Other Authors: Moodley, Deshendran
Format: Thesis
Language:English
Published: Department of Computer Science 2024
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613228776816640
access_status_str Open Access
author Boakes, Jamie
author2 Moodley, Deshendran
author_browse Boakes, Jamie
Moodley, Deshendran
author_facet Moodley, Deshendran
Boakes, Jamie
author_sort Boakes, Jamie
collection Thesis
description The prediction of long-term share returns is an essential yet complex task in financial analysis and formulating investment strategy. Machine learning is a promising approach for improving the accuracy of these predictions. However, the outputs of machine learning models are not transparent or interpretable, which limits their usability for real-world decision making. There is a lack of research on the use of machine learning algorithms to predict long-term share returns on the Johannesburg Stock Exchange (JSE), with no studies that specifically examine the interpretability of machine learning algorithms. This study investigates the use of machine learning algorithms to predict long-term share returns on the JSE based on fundamental data and analyses the interpretability of the top performing algorithms. Based on a review of the literature, eight machine learning classification algorithms were selected and compared to predict tercile class 12-month share returns using fundamental data, spanning a period of two decades. The XGBoost, Random Forest, and GradBoost algorithms were found to outperform the Support Vector Classifier, Logistic Regression, Decision Tree, Artificial Neural Network, and AdaBoost algorithms. XGBoost and Random Forest were further investigated using SHAP (SHapley Additive exPlanations) global summary plots to identify the most influential input features and to analyse the interpretability of these algorithms. The study found that ensemble-based classification algorithms, i.e. XGBoost, Random Forest and GradBoost, outperformed the other algorithms. Further analysis of the results varied, with some sectors outperforming the overall market. An analysis of the input features identified the most important valuation and profitability ratios that contributed to prediction performance, and thus improves the transparency and interpretability of the models. This research is the first to comprehensively compare and analyse the interpretability of machine learning algorithms to predict long-term share returns on the JSE.
format Thesis
id oai:open.uct.ac.za:11427/40761
institution University of Cape Town (South Africa)
language eng
last_indexed 2026-06-10T12:32:47.627Z
license_str Not specified — see source repository
provenance_str_mv Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository
publishDate 2024
publishDateRange 2024
publishDateSort 2024
publisher Department of Computer Science
publisherStr Department of Computer Science
record_format dspace
source_str UCTD — University of Cape Town Open Access Repository
spelling oai:open.uct.ac.za:11427/40761 An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE Boakes, Jamie Moodley, Deshendran Information Technology The prediction of long-term share returns is an essential yet complex task in financial analysis and formulating investment strategy. Machine learning is a promising approach for improving the accuracy of these predictions. However, the outputs of machine learning models are not transparent or interpretable, which limits their usability for real-world decision making. There is a lack of research on the use of machine learning algorithms to predict long-term share returns on the Johannesburg Stock Exchange (JSE), with no studies that specifically examine the interpretability of machine learning algorithms. This study investigates the use of machine learning algorithms to predict long-term share returns on the JSE based on fundamental data and analyses the interpretability of the top performing algorithms. Based on a review of the literature, eight machine learning classification algorithms were selected and compared to predict tercile class 12-month share returns using fundamental data, spanning a period of two decades. The XGBoost, Random Forest, and GradBoost algorithms were found to outperform the Support Vector Classifier, Logistic Regression, Decision Tree, Artificial Neural Network, and AdaBoost algorithms. XGBoost and Random Forest were further investigated using SHAP (SHapley Additive exPlanations) global summary plots to identify the most influential input features and to analyse the interpretability of these algorithms. The study found that ensemble-based classification algorithms, i.e. XGBoost, Random Forest and GradBoost, outperformed the other algorithms. Further analysis of the results varied, with some sectors outperforming the overall market. An analysis of the input features identified the most important valuation and profitability ratios that contributed to prediction performance, and thus improves the transparency and interpretability of the models. This research is the first to comprehensively compare and analyse the interpretability of machine learning algorithms to predict long-term share returns on the JSE. 2024-12-02T10:46:34Z 2024-12-02T10:46:34Z 2024 2024-11-28T10:53:10Z Thesis / Dissertation Masters MSc http://hdl.handle.net/11427/40761 eng application/pdf Department of Computer Science Faculty of Science University of Cape Town
spellingShingle Information Technology
Boakes, Jamie
An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE
thesis_degree_str Master's
title An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE
title_full An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE
title_fullStr An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE
title_full_unstemmed An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE
title_short An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE
title_sort analysis of the performance and interpretability of machine learning classification algorithms to predict long term share returns on the jse
topic Information Technology
url http://hdl.handle.net/11427/40761
work_keys_str_mv AT boakesjamie ananalysisoftheperformanceandinterpretabilityofmachinelearningclassificationalgorithmstopredictlongtermsharereturnsonthejse
AT boakesjamie analysisoftheperformanceandinterpretabilityofmachinelearningclassificationalgorithmstopredictlongtermsharereturnsonthejse