Autonomous Excavation Using Reinforcement Learning with Proximal Policy Optimization
dc.contributor.author | Sanderöd, Mårten | |
dc.contributor.author | Tryggvason, Oskar | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för mekanik och maritima vetenskaper | sv |
dc.contributor.department | Chalmers University of Technology / Department of Mechanics and Maritime Sciences | en |
dc.contributor.examiner | Forsberg, Peter | |
dc.contributor.supervisor | Carlson, Marcus | |
dc.contributor.supervisor | Landgren, Malte | |
dc.date.accessioned | 2025-07-01T07:36:14Z | |
dc.date.issued | 2025 | |
dc.date.submitted | ||
dc.description.abstract | This thesis presents a reinforcement learning based approach for grading, applied in the Volvo excavator EC550E. The work was based around a simulation model of the excavator which facilitated easy training of the algorithm. A hydraulic controller was trained using proximal policy optimization. Together with the hydraulic controller, a PID was implemented as a positional controller for a complete system capable of performing grading tasks. Training was conducted by testing different reward functions and parameter choices to improve policy performance. The results showcases hyperparameter evaluation, velocity tracking accuracy for the hydraulic controller as well as grading accuracy of the complete system. The implemented solution had an accuracy of ± 4 cm during grading. However, the hydraulic controller was not able to consistently follow the target velocities in the cylinders, particularly for the bucket. In future works the hydraulic controller needs to be retrained for better precision before being deployed in a real machine. This thesis shows the potential and possibility of replacing traditional control policies with an machine-learning driven approach. | |
dc.identifier.coursecode | MMSX30 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12380/309790 | |
dc.language.iso | eng | |
dc.setspec.uppsok | Technology | |
dc.subject | autonomous excavation | |
dc.subject | excavator | |
dc.subject | hydraulics | |
dc.subject | IMVT | |
dc.subject | machine learning | |
dc.subject | proximal policy | |
dc.subject | optimization | |
dc.subject | reinforcement learning | |
dc.title | Autonomous Excavation Using Reinforcement Learning with Proximal Policy Optimization | |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.degree | Master's Thesis | en |
dc.type.uppsok | H | |
local.programme | Complex adaptive systems (MPCAS), MSc |