Full Text Available

Access Repository

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

A generic optimisation framework for reinforcement learning in the foreign exchange market

Thesis (PhD)--Stellenbosch University, 2026.

Saved in:

Bibliographic Details
Main Author:	De Wit, John Spencer
Other Authors:	Van Vuuren, J. H.
Format:	Thesis
Language:	English
Published:	Stellenbosch : Stellenbosch University 2026
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1867613840201482240
access_status_str	Open Access
author	De Wit, John Spencer
author2	Van Vuuren, J. H.
author_browse	De Wit, John Spencer Van Vuuren, J. H.
author_facet	Van Vuuren, J. H. De Wit, John Spencer
author_sort	De Wit, John Spencer
collection	Thesis
dc_rights_str_mv	Stellenbosch University
description	Thesis (PhD)--Stellenbosch University, 2026.
format	Thesis
id	oai:scholar.sun.ac.za:10019.1/135739
institution	Stellenbosch University (South Africa)
language	English
last_indexed	2026-06-10T12:42:31.964Z
license_str	Other — see source repository
provenance_str_mv	Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate	2026
publishDateRange	2026
publishDateSort	2026
publisher	Stellenbosch : Stellenbosch University
publisherStr	Stellenbosch : Stellenbosch University
record_format	dspace
source_str	SUNScholar — Stellenbosch University Repository
spelling	oai:scholar.sun.ac.za:10019.1/135739 A generic optimisation framework for reinforcement learning in the foreign exchange market De Wit, John Spencer Van Vuuren, J. H. Stellenbosch University. Faculty of Engineering. Dept. of Industrial Engineering. Thesis (PhD)--Stellenbosch University, 2026. De Wit, J. S. 2026. A generic optimisation framework for reinforcement learning in the foreign exchange market. Unpublished doctoral dissertation. Stellenbosch: Stellenbosch University [online]. Available: https://scholar.sun.ac.za/items/3ff15aa9-fe84-4dbc-a9cb-3ddc75eab4cf The application of algorithmic trading in financial markets has grown considerably in recent years, with deep reinforcement learning emerging as a prominent technique for developing autonomous trading agents. The success of such agents is, however, often hindered by the non-stationary nature of financial markets and the inherently opaque decision-making processes of deep reinforcement learning models. An abundance of research has been dedicated to applying deep reinforcement learning in finance, resulting in various frameworks for algorithmic trading. Most of these existing frameworks are, however, aimed at single facets of the trading problem, such as the application of a specific deep reinforcement learning algorithm or a narrow feature engineering approach. Generic, integrated frameworks that accommodate market non-stationarity and embed explainability techniques into the model development lifecycle are largely absent from the literature. A generic framework is proposed in this dissertation that facilitates a systematic approach towards the optimisation, comparative evaluation, and selection of deep reinforcement learning trading agents. The design of the framework accommodates market non-stationarity by following a non-parametric data processing pipeline, which utilises time series partitioning techniques and distribution-based clustering approaches to identify and classify distinct market regimes. These regime-labelled data are then utilised in an iterative process during which explainable artificial intelligence techniques are employed to facilitate data-driven state-space optimisation and agent transparency enhancement. The framework culminates in a robust evaluation methodology—comprising multi-seed training, rolling-window back-testing, and the execution of non-parametric statistical tests—in order to quantify and comparatively analyse the performance of deep reinforcement learning candidate agents, thereby ultimately guiding the user's selection of a suitable configuration for a given market context. A computerised instantiation of the framework is implemented as a proof of concept. The efficacy of its constituent components is first verified in respect of synthetic and historical market data, after which the practical applicability of the framework is demonstrated by applying the framework instantiation to two case studies comprising historical market data. Doctoral 2026-04-09T06:55:32Z 2026-04-09T06:55:32Z 2026-03 Thesis https://scholar.sun.ac.za/handle/10019.1/135739 en Stellenbosch University 403 pages application/pdf Stellenbosch : Stellenbosch University
spellingShingle	De Wit, John Spencer A generic optimisation framework for reinforcement learning in the foreign exchange market
title	A generic optimisation framework for reinforcement learning in the foreign exchange market
title_full	A generic optimisation framework for reinforcement learning in the foreign exchange market
title_fullStr	A generic optimisation framework for reinforcement learning in the foreign exchange market
title_full_unstemmed	A generic optimisation framework for reinforcement learning in the foreign exchange market
title_short	A generic optimisation framework for reinforcement learning in the foreign exchange market
title_sort	generic optimisation framework for reinforcement learning in the foreign exchange market
url	https://scholar.sun.ac.za/handle/10019.1/135739
work_keys_str_mv	AT dewitjohnspencer agenericoptimisationframeworkforreinforcementlearningintheforeignexchangemarket AT dewitjohnspencer genericoptimisationframeworkforreinforcementlearningintheforeignexchangemarket

Full Text Available

A generic optimisation framework for reinforcement learning in the foreign exchange market

Similar Items