Full Text Available
Note: Clicking the button above will open the full text document at the original institutional repository in a new window.
Thesis (PhD)--Stellenbosch University, 2023.
| Main Author: | |
|---|---|
| Other Authors: | |
| Format: | Thesis |
| Language: | en_ZA en_ZA |
| Published: |
Stellenbosch : Stellenbosch University
2023
|
| Subjects: | |
| Tags: |
No Tags, Be the first to tag this record!
|
| _version_ | 1867613764057038848 |
|---|---|
| access_status_str | Open Access |
| author | Evans, Benjamin |
| author2 | Jordaan, Willem |
| author_browse | Evans, Benjamin Jordaan, Willem |
| author_facet | Jordaan, Willem Evans, Benjamin |
| author_sort | Evans, Benjamin |
| collection | Thesis |
| dc_rights_str_mv | Stellenbosch University |
| description | Thesis (PhD)--Stellenbosch University, 2023. |
| format | Thesis |
| id | oai:scholar.sun.ac.za:10019.1/127272 |
| institution | Stellenbosch University (South Africa) |
| language | en_ZA en_ZA |
| last_indexed | 2026-06-10T12:41:19.685Z |
| license_str | Other — see source repository |
| provenance_str_mv | Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository |
| publishDate | 2023 |
| publishDateRange | 2023 |
| publishDateSort | 2023 |
| publisher | Stellenbosch : Stellenbosch University |
| publisherStr | Stellenbosch : Stellenbosch University |
| record_format | dspace |
| source_str | SUNScholar — Stellenbosch University Repository |
| spelling | oai:scholar.sun.ac.za:10019.1/127272 Accelerating deep reinforcement learning for autonomous racing Evans, Benjamin Jordaan, Willem Engelbrecht, Herman Stellenbosch University. Faculty of Engineering. Dept. of Electrical and Electronic Engineering. Formula One automobiles Automated vehicles Machinery, Kinematics of Reinforcement learning Thesis (PhD)--Stellenbosch University, 2023. ENGLISH ABSTRACT: The F1/10th racing problem is to use the onboard LiDAR scan to calculate speed and steering references to move a 1/10th scale car around the track as quickly as possible. While planning has typically used perception, planning and control pipelines, recently, deep reinforcement learning (DRL) has grown in popularity due to its advantages of not requiring explicit state representation and environmental flexibility. Current approaches have suffered from poor performance at low speeds, safety concerns exacerbated by sim-toreal transfer, and few approaches have considered obstacle avoidance. The first contribution of this work is the development of high-speed learning formulations for autonomous racing. A comprehensive evaluation of previous approaches concludes that current learning formulations train agents to select infeasible speed profiles, resulting in the agents being unable to race using the vehicle’s full speed profile. This problem is overcome by using analytical vehicle models to develop learning formulations for improved speed selection. The performance evaluation shows that the novel formulations enable the vehicle to learn a feasible speed profile using the vehicle’s full speed range and achieve lower lap times than previous methods in the literature. This result indicates that using vehicle models improves high-performance racing behaviour. The second contribution of this work is to enable online learning by using a supervisory safety system (SSS). A safety system is designed that uses viability theory to ensure vehicle safety, irrespective of the planner used. The SSS is incorporated into the learning formulation and used to train DRL agents to race without them ever crashing. The novel learning formulation is extensively evaluated in simulation, demonstrating that online training can train agents to race without ever crashing, achieve a 10× improvement in sample efficiency and that the trained agents select conservative speed profiles. The proposed method is validated at constant speed on a physical vehicle, demonstrating that an agent can be trained from random to drive around a track without ever crashing. The final contribution of this work is to explore how DRL agents can be used to expand the ability of current classical planners to avoid unmapped obstacles. Three hybrid architectures that combine classical and learning components are presented and evaluated. The modification planner, which combines a path follower and DRL agent in parallel, demonstrates the ability to track a reference path while avoiding unmapped obstacles. The results indicate that combining classical and DRL components can improve the performance of DRL agents while enabling classical solutions to avoid obstacles. AFRIKAANSE OPSOMMING: Geen opsomming beskikbaar. Doctorate 2023-03-03T07:50:21Z 2023-05-18T07:13:21Z 2023-03-03T07:50:21Z 2023-05-18T07:13:21Z 2023-03 Thesis http://hdl.handle.net/10019.1/127272 en_ZA en_ZA Stellenbosch University ix, 175 pages : illustrations application/pdf Stellenbosch : Stellenbosch University |
| spellingShingle | Formula One automobiles Automated vehicles Machinery, Kinematics of Reinforcement learning Evans, Benjamin Accelerating deep reinforcement learning for autonomous racing |
| title | Accelerating deep reinforcement learning for autonomous racing |
| title_full | Accelerating deep reinforcement learning for autonomous racing |
| title_fullStr | Accelerating deep reinforcement learning for autonomous racing |
| title_full_unstemmed | Accelerating deep reinforcement learning for autonomous racing |
| title_short | Accelerating deep reinforcement learning for autonomous racing |
| title_sort | accelerating deep reinforcement learning for autonomous racing |
| topic | Formula One automobiles Automated vehicles Machinery, Kinematics of Reinforcement learning |
| url | http://hdl.handle.net/10019.1/127272 |
| work_keys_str_mv | AT evansbenjamin acceleratingdeepreinforcementlearningforautonomousracing |