Autonomous Excavation Using Reinforcement Learning with Proximal Policy Optimization

Sanderöd, Mårten; Tryggvason, Oskar

Autonomous Excavation Using Reinforcement Learning with Proximal Policy Optimization

dc.contributor.author	Sanderöd, Mårten
dc.contributor.author	Tryggvason, Oskar
dc.contributor.department	Chalmers tekniska högskola / Institutionen för mekanik och maritima vetenskaper	sv
dc.contributor.department	Chalmers University of Technology / Department of Mechanics and Maritime Sciences	en
dc.contributor.examiner	Forsberg, Peter
dc.contributor.supervisor	Carlson, Marcus
dc.contributor.supervisor	Landgren, Malte
dc.date.accessioned	2025-07-01T07:36:14Z
dc.date.issued	2025
dc.date.submitted
dc.description.abstract	This thesis presents a reinforcement learning based approach for grading, applied in the Volvo excavator EC550E. The work was based around a simulation model of the excavator which facilitated easy training of the algorithm. A hydraulic controller was trained using proximal policy optimization. Together with the hydraulic controller, a PID was implemented as a positional controller for a complete system capable of performing grading tasks. Training was conducted by testing different reward functions and parameter choices to improve policy performance. The results showcases hyperparameter evaluation, velocity tracking accuracy for the hydraulic controller as well as grading accuracy of the complete system. The implemented solution had an accuracy of ± 4 cm during grading. However, the hydraulic controller was not able to consistently follow the target velocities in the cylinders, particularly for the bucket. In future works the hydraulic controller needs to be retrained for better precision before being deployed in a real machine. This thesis shows the potential and possibility of replacing traditional control policies with an machine-learning driven approach.
dc.identifier.coursecode	MMSX30
dc.identifier.uri	http://hdl.handle.net/20.500.12380/309790
dc.language.iso	eng
dc.setspec.uppsok	Technology
dc.subject	autonomous excavation
dc.subject	excavator
dc.subject	hydraulics
dc.subject	IMVT
dc.subject	machine learning
dc.subject	proximal policy
dc.subject	optimization
dc.subject	reinforcement learning
dc.title	Autonomous Excavation Using Reinforcement Learning with Proximal Policy Optimization
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.degree	Master's Thesis	en
dc.type.uppsok	H
local.programme	Complex adaptive systems (MPCAS), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: 2025 Mårten Sanderöd & Oskar Tryggvason.pdf
Storlek:: 4.22 MB
Format:: Adobe Portable Document Format

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Storlek:: 2.35 KB
Format:: Item-specific license agreed upon to submission
Beskrivning:

Ladda ner

Samlingar

Examensarbeten för masterexamen