Adaptive Model-Free Control Applied to Truck Front Wheel Drive: Real time control with reinforcement learning utilising recurrent deterministic policy gradient

Johansson, Oskar; Lundgren, Benjamin

Adaptive Model-Free Control Applied to Truck Front Wheel Drive: Real time control with reinforcement learning utilising recurrent deterministic policy gradient

Ladda ner

2021-39 Oskar Johansson & Benjamin Lundgren.pdf (2.79 MB)

Publicerad

2021

Författare

Johansson, Oskar

Lundgren, Benjamin

Typ

Examensarbete för masterexamen

Program

Systems, control and mechatronics (MPSYS), MSc

Sammanfattning

This thesis investigates how reinforcement learning methods can be used to achieve adaptive and model-free control of a hydraulic front-wheel drive system. First, an existing sub-optimal controller is emulated by an artificial neural network using supervised learning. The continuous action- and state space reinforcement learning method “Recurrent Deterministic Policy Gradients” (RDPG) is then modified to work continuously in real time and implemented to improve the performance of the network and make it adaptive. The emulating network performed similarly, albeit somewhat worse, compared to the original controller. Using RDPG with the emulating network against a simple model of the hydraulic system showed that the network adapted and further improved the performance from the sub-optimal starting point. However, applying the RDPG algorithm against a real system was infeasible with the selected hyper-parameters and would require further investigation for the algorithm to converge. The conclusion is that using RDPG for adaptive model-free control can be feasible for non-linear dynamic system that exhibit slow, gradual changes and that first emulating a sub-optimal controller can enable learning on a system where learning from the beginning is not desired or possible.

Ämne/nyckelord

Reinforcement Learning, Machine Learning, Transfer Learning, Recurrent Deterministic Policy Gradients (RDPG), Adaptive control, Model-Free Control

URI

https://hdl.handle.net/20.500.12380/302725

Samlingar

Examensarbeten för masterexamen

Visa fullständig post

Adaptive Model-Free Control Applied to Truck Front Wheel Drive: Real time control with reinforcement learning utilising recurrent deterministic policy gradient

Ladda ner

Publicerad

Författare

Typ

Program

Modellbyggare

Tidskriftstitel

ISSN

Volymtitel

Utgivare

Sammanfattning

Beskrivning

Ämne/nyckelord

Citation

Arkitekt (konstruktör)

Geografisk plats

Byggnad (typ)

Byggår

Modelltyp

Skala

Teknik / material

Index

URI

Samlingar

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced