Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Accelerating deep reinforcement learning for autonomous racing

Thesis (PhD)--Stellenbosch University, 2023.

Saved in:
Bibliographic Details
Main Author: Evans, Benjamin
Other Authors: Jordaan, Willem
Format: Thesis
Language:en_ZA
en_ZA
Published: Stellenbosch : Stellenbosch University 2023
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613764057038848
access_status_str Open Access
author Evans, Benjamin
author2 Jordaan, Willem
author_browse Evans, Benjamin
Jordaan, Willem
author_facet Jordaan, Willem
Evans, Benjamin
author_sort Evans, Benjamin
collection Thesis
dc_rights_str_mv Stellenbosch University
description Thesis (PhD)--Stellenbosch University, 2023.
format Thesis
id oai:scholar.sun.ac.za:10019.1/127272
institution Stellenbosch University (South Africa)
language en_ZA
en_ZA
last_indexed 2026-06-10T12:41:19.685Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2023
publishDateRange 2023
publishDateSort 2023
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/127272 Accelerating deep reinforcement learning for autonomous racing Evans, Benjamin Jordaan, Willem Engelbrecht, Herman Stellenbosch University. Faculty of Engineering. Dept. of Electrical and Electronic Engineering. Formula One automobiles Automated vehicles Machinery, Kinematics of Reinforcement learning Thesis (PhD)--Stellenbosch University, 2023. ENGLISH ABSTRACT: The F1/10th racing problem is to use the onboard LiDAR scan to calculate speed and steering references to move a 1/10th scale car around the track as quickly as possible. While planning has typically used perception, planning and control pipelines, recently, deep reinforcement learning (DRL) has grown in popularity due to its advantages of not requiring explicit state representation and environmental flexibility. Current approaches have suffered from poor performance at low speeds, safety concerns exacerbated by sim-toreal transfer, and few approaches have considered obstacle avoidance. The first contribution of this work is the development of high-speed learning formulations for autonomous racing. A comprehensive evaluation of previous approaches concludes that current learning formulations train agents to select infeasible speed profiles, resulting in the agents being unable to race using the vehicle’s full speed profile. This problem is overcome by using analytical vehicle models to develop learning formulations for improved speed selection. The performance evaluation shows that the novel formulations enable the vehicle to learn a feasible speed profile using the vehicle’s full speed range and achieve lower lap times than previous methods in the literature. This result indicates that using vehicle models improves high-performance racing behaviour. The second contribution of this work is to enable online learning by using a supervisory safety system (SSS). A safety system is designed that uses viability theory to ensure vehicle safety, irrespective of the planner used. The SSS is incorporated into the learning formulation and used to train DRL agents to race without them ever crashing. The novel learning formulation is extensively evaluated in simulation, demonstrating that online training can train agents to race without ever crashing, achieve a 10× improvement in sample efficiency and that the trained agents select conservative speed profiles. The proposed method is validated at constant speed on a physical vehicle, demonstrating that an agent can be trained from random to drive around a track without ever crashing. The final contribution of this work is to explore how DRL agents can be used to expand the ability of current classical planners to avoid unmapped obstacles. Three hybrid architectures that combine classical and learning components are presented and evaluated. The modification planner, which combines a path follower and DRL agent in parallel, demonstrates the ability to track a reference path while avoiding unmapped obstacles. The results indicate that combining classical and DRL components can improve the performance of DRL agents while enabling classical solutions to avoid obstacles. AFRIKAANSE OPSOMMING: Geen opsomming beskikbaar. Doctorate 2023-03-03T07:50:21Z 2023-05-18T07:13:21Z 2023-03-03T07:50:21Z 2023-05-18T07:13:21Z 2023-03 Thesis http://hdl.handle.net/10019.1/127272 en_ZA en_ZA Stellenbosch University ix, 175 pages : illustrations application/pdf Stellenbosch : Stellenbosch University
spellingShingle Formula One automobiles
Automated vehicles
Machinery, Kinematics of
Reinforcement learning
Evans, Benjamin
Accelerating deep reinforcement learning for autonomous racing
title Accelerating deep reinforcement learning for autonomous racing
title_full Accelerating deep reinforcement learning for autonomous racing
title_fullStr Accelerating deep reinforcement learning for autonomous racing
title_full_unstemmed Accelerating deep reinforcement learning for autonomous racing
title_short Accelerating deep reinforcement learning for autonomous racing
title_sort accelerating deep reinforcement learning for autonomous racing
topic Formula One automobiles
Automated vehicles
Machinery, Kinematics of
Reinforcement learning
url http://hdl.handle.net/10019.1/127272
work_keys_str_mv AT evansbenjamin acceleratingdeepreinforcementlearningforautonomousracing