Adaptive Model-Free Control Applied to Truck Front Wheel Drive: Real time control with reinforcement learning utilising recurrent deterministic policy gradient
dc.contributor.author | Johansson, Oskar | |
dc.contributor.author | Lundgren, Benjamin | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för mekanik och maritima vetenskaper | sv |
dc.contributor.examiner | Forsberg, Peter | |
dc.contributor.supervisor | Karlsson, Martin | |
dc.contributor.supervisor | Broberg, Marcus | |
dc.contributor.supervisor | Göök, Ola | |
dc.date.accessioned | 2021-06-27T11:44:15Z | |
dc.date.available | 2021-06-27T11:44:15Z | |
dc.date.issued | 2021 | sv |
dc.date.submitted | 2020 | |
dc.description.abstract | This thesis investigates how reinforcement learning methods can be used to achieve adaptive and model-free control of a hydraulic front-wheel drive system. First, an existing sub-optimal controller is emulated by an artificial neural network using supervised learning. The continuous action- and state space reinforcement learning method “Recurrent Deterministic Policy Gradients” (RDPG) is then modified to work continuously in real time and implemented to improve the performance of the network and make it adaptive. The emulating network performed similarly, albeit somewhat worse, compared to the original controller. Using RDPG with the emulating network against a simple model of the hydraulic system showed that the network adapted and further improved the performance from the sub-optimal starting point. However, applying the RDPG algorithm against a real system was infeasible with the selected hyper-parameters and would require further investigation for the algorithm to converge. The conclusion is that using RDPG for adaptive model-free control can be feasible for non-linear dynamic system that exhibit slow, gradual changes and that first emulating a sub-optimal controller can enable learning on a system where learning from the beginning is not desired or possible. | sv |
dc.identifier.coursecode | MMSX30 | sv |
dc.identifier.uri | https://hdl.handle.net/20.500.12380/302725 | |
dc.language.iso | eng | sv |
dc.relation.ispartofseries | 2021:39 | sv |
dc.setspec.uppsok | Technology | |
dc.subject | Reinforcement Learning | sv |
dc.subject | Machine Learning | sv |
dc.subject | Transfer Learning | sv |
dc.subject | Recurrent Deterministic Policy Gradients (RDPG) | sv |
dc.subject | Adaptive control | sv |
dc.subject | Model-Free Control | sv |
dc.title | Adaptive Model-Free Control Applied to Truck Front Wheel Drive: Real time control with reinforcement learning utilising recurrent deterministic policy gradient | sv |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.uppsok | H | |
local.programme | Systems, control and mechatronics (MPSYS), MSc |
Ladda ner
Original bundle
1 - 1 av 1
Hämtar...
- Namn:
- 2021-39 Oskar Johansson & Benjamin Lundgren.pdf
- Storlek:
- 2.79 MB
- Format:
- Adobe Portable Document Format
- Beskrivning:
- Master Thesis
License bundle
1 - 1 av 1
Hämtar...
- Namn:
- license.txt
- Storlek:
- 1.51 KB
- Format:
- Item-specific license agreed upon to submission
- Beskrivning: