Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Deep learning for tabular data : an exploratory study

Thesis (MCom)--Stellenbosch University, 2019.

Saved in:
Bibliographic Details
Main Author: Marais, Jan Andre
Other Authors: Bierman, Surette
Format: Thesis
Language:en_ZA
Published: Stellenbosch : Stellenbosch University 2019
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867614025252077568
access_status_str Open Access
author Marais, Jan Andre
author2 Bierman, Surette
author_browse Bierman, Surette
Marais, Jan Andre
author_facet Bierman, Surette
Marais, Jan Andre
author_sort Marais, Jan Andre
collection Thesis
dc_rights_str_mv Stellenbosch University
description Thesis (MCom)--Stellenbosch University, 2019.
format Thesis
id oai:scholar.sun.ac.za:10019.1/106113
institution Stellenbosch University (South Africa)
language en_ZA
last_indexed 2026-06-10T12:45:28.762Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2019
publishDateRange 2019
publishDateSort 2019
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/106113 Deep learning for tabular data : an exploratory study Marais, Jan Andre Bierman, Surette Stellenbosch University. Faculty of Economic and Management Sciences. Dept. of Statistics and Actuarial Science. Machine learning Deep learning Neural networks (Computer science) Tabular Data UCTD Thesis (MCom)--Stellenbosch University, 2019. ENGLISH SUMMARY : From about 2006, deep learning has proven to be very successul in application areas such as computer vision, natural language processing, speech and audio recognition, machine translation, bioinformatics, and social network filtering. These successes were undoubtedly facilitated by many advances in neural network architectures. In contrast, deep learning has not yet been found to excel in the context of tabular datasets. Many key machine learning tasks make use of tabular data, where currently the best machine learning models for tabular data use classification or regression trees as base learners. Therefore, the objective of this study is to identify, discuss and explore recent developments in deep learning which may be used to enhance the accuracy of deep neural networks in the tabular data domain. All major developments in the deep learning field are discussed and critically considered, with a view to improving deep learning in the context of tabular data. The challenges of applying deep learning to tabular data are identified, and on each of these fronts, potential improvements are proposed. The most promising modern deep learning architectures are further explored by means of empirical work. We also evaluate the validity of findings reported in the literature, and comment on the effectiveness of recent proposals. A useful byproduct of the study is the development of a code base that may be used to implement the latest deep learning techniques, as well as for comparative model selection experiments. AFRIKAANSE OPSOMMING : Vanaf ongeveer 2006 is die sukses van diepleer-tegnieke in toespassings-areas soos rekenaarvisie, taalprosessering, spraak- en klankherkenning, masjienvertaling, bio-informatika, en om sosiale netwerk te filtreer, alombekend. Die sukses van diepleer-metodes is ongetwyfeld aangehelp deur baie ontwikkelings rondom die argitektuur van neurale netwerke. Nogtans is bevind dat diep neural netwerke tot dusver nie goed vaar in die konteks van die gebruik van gewone matriksvorm data nie. Verskeie belangrike masjienleer take maak gebruik van matriksvorm data, waar die beste masjienleer modelle in hierdie konteks klassifikasie- of regressiebome gebruik as basis. Derhalwe is die doelwit van hierdie studie om onlangse ontwikkelings in diepleer (wat gebruik kan word om die akkuraatheid van diep neural netwerke te verbeter in die konteks van matriksvorm-data), te identifiseer, te bespreek, en empiries te ondersoek. Alle belangrike ontwikkelings in die diepleer veld word bespreek, en krities beskou, ten einde diepleer te verbeter in die konteks van matriksvorm data. Die uitdagings wat die toepassing van diepleer op matriksvorm data bied, word geidentifiseer, en op elkeen van hierdie fronte word potensiële verbeterings voorgestel. Die belowendste moderne diepleer argitekture word deur middel van empiriese werk verder verken. Ons evalueer ook die geldigheid van bevindings wat in die literatuur rapporteer word, en lewer kommentaar op die effektiwiteit van onlangse voorstelle. ’n Nuttige byproduk van die studie is die ontwikkeling van ’n kodebasis wat gebruik kan word vir die implementering van die nuutste diepleer-tegnieke, asook vir vergelykende eksperimente rondom modelseleksie. Masters 2019-02-26T11:01:04Z 2019-04-17T08:29:14Z 2019-02-26T11:01:04Z 2019-04-17T08:29:14Z 2019-04 Thesis http://hdl.handle.net/10019.1/106113 en_ZA Stellenbosch University xvi, 127 pages ; illustrations, includes annexures application/pdf Stellenbosch : Stellenbosch University
spellingShingle Machine learning
Deep learning
Neural networks (Computer science)
Tabular Data
UCTD
Marais, Jan Andre
Deep learning for tabular data : an exploratory study
title Deep learning for tabular data : an exploratory study
title_full Deep learning for tabular data : an exploratory study
title_fullStr Deep learning for tabular data : an exploratory study
title_full_unstemmed Deep learning for tabular data : an exploratory study
title_short Deep learning for tabular data : an exploratory study
title_sort deep learning for tabular data an exploratory study
topic Machine learning
Deep learning
Neural networks (Computer science)
Tabular Data
UCTD
url http://hdl.handle.net/10019.1/106113
work_keys_str_mv AT maraisjanandre deeplearningfortabulardataanexploratorystudy