Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Unravelling black box machine learning methods using biplots

Following the development of new mathematical techniques, the improvement of computer processing power and the increased availability of possible explanatory variables, the financial services industry is moving toward the use of new machine learning methods, such as neural networks, and away from ol...

Full description

Saved in:
Bibliographic Details
Main Author: Rowan, Adriaan
Other Authors: Little, Francesca
Format: Thesis
Language:English
Published: Department of Statistical Sciences 2020
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613231154987008
access_status_str Open Access
author Rowan, Adriaan
author2 Little, Francesca
author_browse Little, Francesca
Rowan, Adriaan
author_facet Little, Francesca
Rowan, Adriaan
author_sort Rowan, Adriaan
collection Thesis
description Following the development of new mathematical techniques, the improvement of computer processing power and the increased availability of possible explanatory variables, the financial services industry is moving toward the use of new machine learning methods, such as neural networks, and away from older methods such as generalised linear models. However, their use is currently limited because they are seen as “black box” models, which gives predictions without justifications and which are therefore not understood and cannot be trusted. The goal of this dissertation is to expand on the theory and use of biplots to visualise the impact of the various input factors on the output of the machine learning black box. Biplots are used because they give an optimal two-dimensional representation of the data set on which the machine learning model is based.The biplot allows every point on the biplot plane to be converted back to the original ��-dimensions – in the same format as is used by the machine learning model. This allows the output of the model to be represented by colour coding each point on the biplot plane according to the output of an independently calibrated machine learning model. The interaction of the changing prediction probabilities – represented by the coloured output – in relation to the data points and the variable axes and category level points represented on the biplot, allows the machine learning model to be globally and locally interpreted. By visualing the models and their predictions, this dissertation aims to remove the stigma of calling non-linear models “black box” models and encourage their wider application in the financial services industry.
format Thesis
id oai:open.uct.ac.za:11427/31124
institution University of Cape Town (South Africa)
language eng
last_indexed 2026-06-10T12:32:51.499Z
license_str Not specified — see source repository
provenance_str_mv Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository
publishDate 2020
publishDateRange 2020
publishDateSort 2020
publisher Department of Statistical Sciences
publisherStr Department of Statistical Sciences
record_format dspace
source_str UCTD — University of Cape Town Open Access Repository
spelling oai:open.uct.ac.za:11427/31124 Unravelling black box machine learning methods using biplots Rowan, Adriaan Little, Francesca Lubbe, Sugnet Statistics Following the development of new mathematical techniques, the improvement of computer processing power and the increased availability of possible explanatory variables, the financial services industry is moving toward the use of new machine learning methods, such as neural networks, and away from older methods such as generalised linear models. However, their use is currently limited because they are seen as “black box” models, which gives predictions without justifications and which are therefore not understood and cannot be trusted. The goal of this dissertation is to expand on the theory and use of biplots to visualise the impact of the various input factors on the output of the machine learning black box. Biplots are used because they give an optimal two-dimensional representation of the data set on which the machine learning model is based.The biplot allows every point on the biplot plane to be converted back to the original ��-dimensions – in the same format as is used by the machine learning model. This allows the output of the model to be represented by colour coding each point on the biplot plane according to the output of an independently calibrated machine learning model. The interaction of the changing prediction probabilities – represented by the coloured output – in relation to the data points and the variable axes and category level points represented on the biplot, allows the machine learning model to be globally and locally interpreted. By visualing the models and their predictions, this dissertation aims to remove the stigma of calling non-linear models “black box” models and encourage their wider application in the financial services industry. 2020-02-14T12:04:32Z 2020-02-14T12:04:32Z 2019 2020-02-14T12:04:05Z Master Thesis Masters MSc http://hdl.handle.net/11427/31124 eng application/pdf Department of Statistical Sciences Faculty of Science
spellingShingle Statistics
Rowan, Adriaan
Unravelling black box machine learning methods using biplots
thesis_degree_str Master's
title Unravelling black box machine learning methods using biplots
title_full Unravelling black box machine learning methods using biplots
title_fullStr Unravelling black box machine learning methods using biplots
title_full_unstemmed Unravelling black box machine learning methods using biplots
title_short Unravelling black box machine learning methods using biplots
title_sort unravelling black box machine learning methods using biplots
topic Statistics
url http://hdl.handle.net/11427/31124
work_keys_str_mv AT rowanadriaan unravellingblackboxmachinelearningmethodsusingbiplots