Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

A generic optimisation framework for reinforcement learning in the foreign exchange market

Thesis (PhD)--Stellenbosch University, 2026.

Saved in:
Bibliographic Details
Main Author: De Wit, John Spencer
Other Authors: Van Vuuren, J. H.
Format: Thesis
Language:English
Published: Stellenbosch : Stellenbosch University 2026
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613840201482240
access_status_str Open Access
author De Wit, John Spencer
author2 Van Vuuren, J. H.
author_browse De Wit, John Spencer
Van Vuuren, J. H.
author_facet Van Vuuren, J. H.
De Wit, John Spencer
author_sort De Wit, John Spencer
collection Thesis
dc_rights_str_mv Stellenbosch University
description Thesis (PhD)--Stellenbosch University, 2026.
format Thesis
id oai:scholar.sun.ac.za:10019.1/135739
institution Stellenbosch University (South Africa)
language English
last_indexed 2026-06-10T12:42:31.964Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2026
publishDateRange 2026
publishDateSort 2026
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/135739 A generic optimisation framework for reinforcement learning in the foreign exchange market De Wit, John Spencer Van Vuuren, J. H. Stellenbosch University. Faculty of Engineering. Dept. of Industrial Engineering. Thesis (PhD)--Stellenbosch University, 2026. De Wit, J. S. 2026. A generic optimisation framework for reinforcement learning in the foreign exchange market. Unpublished doctoral dissertation. Stellenbosch: Stellenbosch University [online]. Available: https://scholar.sun.ac.za/items/3ff15aa9-fe84-4dbc-a9cb-3ddc75eab4cf The application of algorithmic trading in financial markets has grown considerably in recent years, with deep reinforcement learning emerging as a prominent technique for developing autonomous trading agents. The success of such agents is, however, often hindered by the non-stationary nature of financial markets and the inherently opaque decision-making processes of deep reinforcement learning models. An abundance of research has been dedicated to applying deep reinforcement learning in finance, resulting in various frameworks for algorithmic trading. Most of these existing frameworks are, however, aimed at single facets of the trading problem, such as the application of a specific deep reinforcement learning algorithm or a narrow feature engineering approach. Generic, integrated frameworks that accommodate market non-stationarity and embed explainability techniques into the model development lifecycle are largely absent from the literature. A generic framework is proposed in this dissertation that facilitates a systematic approach towards the optimisation, comparative evaluation, and selection of deep reinforcement learning trading agents. The design of the framework accommodates market non-stationarity by following a non-parametric data processing pipeline, which utilises time series partitioning techniques and distribution-based clustering approaches to identify and classify distinct market regimes. These regime-labelled data are then utilised in an iterative process during which explainable artificial intelligence techniques are employed to facilitate data-driven state-space optimisation and agent transparency enhancement. The framework culminates in a robust evaluation methodology—comprising multi-seed training, rolling-window back-testing, and the execution of non-parametric statistical tests—in order to quantify and comparatively analyse the performance of deep reinforcement learning candidate agents, thereby ultimately guiding the user's selection of a suitable configuration for a given market context. A computerised instantiation of the framework is implemented as a proof of concept. The efficacy of its constituent components is first verified in respect of synthetic and historical market data, after which the practical applicability of the framework is demonstrated by applying the framework instantiation to two case studies comprising historical market data. Doctoral 2026-04-09T06:55:32Z 2026-04-09T06:55:32Z 2026-03 Thesis https://scholar.sun.ac.za/handle/10019.1/135739 en Stellenbosch University 403 pages application/pdf Stellenbosch : Stellenbosch University
spellingShingle De Wit, John Spencer
A generic optimisation framework for reinforcement learning in the foreign exchange market
title A generic optimisation framework for reinforcement learning in the foreign exchange market
title_full A generic optimisation framework for reinforcement learning in the foreign exchange market
title_fullStr A generic optimisation framework for reinforcement learning in the foreign exchange market
title_full_unstemmed A generic optimisation framework for reinforcement learning in the foreign exchange market
title_short A generic optimisation framework for reinforcement learning in the foreign exchange market
title_sort generic optimisation framework for reinforcement learning in the foreign exchange market
url https://scholar.sun.ac.za/handle/10019.1/135739
work_keys_str_mv AT dewitjohnspencer agenericoptimisationframeworkforreinforcementlearningintheforeignexchangemarket
AT dewitjohnspencer genericoptimisationframeworkforreinforcementlearningintheforeignexchangemarket