Adaptive Model-Free Control Applied to Truck Front Wheel Drive: Real time control with reinforcement learning utilising recurrent deterministic policy gradient

Johansson, Oskar; Lundgren, Benjamin

Adaptive Model-Free Control Applied to Truck Front Wheel Drive: Real time control with reinforcement learning utilising recurrent deterministic policy gradient

dc.contributor.author	Johansson, Oskar
dc.contributor.author	Lundgren, Benjamin
dc.contributor.department	Chalmers tekniska högskola / Institutionen för mekanik och maritima vetenskaper	sv
dc.contributor.examiner	Forsberg, Peter
dc.contributor.supervisor	Karlsson, Martin
dc.contributor.supervisor	Broberg, Marcus
dc.contributor.supervisor	Göök, Ola
dc.date.accessioned	2021-06-27T11:44:15Z
dc.date.available	2021-06-27T11:44:15Z
dc.date.issued	2021	sv
dc.date.submitted	2020
dc.description.abstract	This thesis investigates how reinforcement learning methods can be used to achieve adaptive and model-free control of a hydraulic front-wheel drive system. First, an existing sub-optimal controller is emulated by an artificial neural network using supervised learning. The continuous action- and state space reinforcement learning method “Recurrent Deterministic Policy Gradients” (RDPG) is then modified to work continuously in real time and implemented to improve the performance of the network and make it adaptive. The emulating network performed similarly, albeit somewhat worse, compared to the original controller. Using RDPG with the emulating network against a simple model of the hydraulic system showed that the network adapted and further improved the performance from the sub-optimal starting point. However, applying the RDPG algorithm against a real system was infeasible with the selected hyper-parameters and would require further investigation for the algorithm to converge. The conclusion is that using RDPG for adaptive model-free control can be feasible for non-linear dynamic system that exhibit slow, gradual changes and that first emulating a sub-optimal controller can enable learning on a system where learning from the beginning is not desired or possible.	sv
dc.identifier.coursecode	MMSX30	sv
dc.identifier.uri	https://hdl.handle.net/20.500.12380/302725
dc.language.iso	eng	sv
dc.relation.ispartofseries	2021:39	sv
dc.setspec.uppsok	Technology
dc.subject	Reinforcement Learning	sv
dc.subject	Machine Learning	sv
dc.subject	Transfer Learning	sv
dc.subject	Recurrent Deterministic Policy Gradients (RDPG)	sv
dc.subject	Adaptive control	sv
dc.subject	Model-Free Control	sv
dc.title	Adaptive Model-Free Control Applied to Truck Front Wheel Drive: Real time control with reinforcement learning utilising recurrent deterministic policy gradient	sv
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.uppsok	H
local.programme	Systems, control and mechatronics (MPSYS), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: 2021-39 Oskar Johansson & Benjamin Lundgren.pdf
Storlek:: 2.79 MB
Format:: Adobe Portable Document Format
Beskrivning:: Master Thesis

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Storlek:: 1.51 KB
Format:: Item-specific license agreed upon to submission
Beskrivning:

Ladda ner

Samlingar

Examensarbeten för masterexamen