Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Self-attention policy architectures for reinforcement learning under partial observability

Intermittent unavailability of sensory signals due to sensor failure and/or latency is a problem encountered in production environments such as in large manufacturing plants, for example. Deep reinforcement learning offers a natural solution for process control and optimisation in such environments....

Full description

Saved in:
Bibliographic Details
Main Author: Du Plessis, Jeremy
Other Authors: Shock, Jonathan
Format: Thesis
Language:English
English
Published: Department of Mathematics and Applied Mathematics 2025
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!

Similar Items: Self-attention policy architectures for reinforcement learning under partial observability