Autonomous Excavation Using Reinforcement Learning with Proximal Policy Optimization

dc.contributor.authorSanderöd, Mårten
dc.contributor.authorTryggvason, Oskar
dc.contributor.departmentChalmers tekniska högskola / Institutionen för mekanik och maritima vetenskapersv
dc.contributor.departmentChalmers University of Technology / Department of Mechanics and Maritime Sciencesen
dc.contributor.examinerForsberg, Peter
dc.contributor.supervisorCarlson, Marcus
dc.contributor.supervisorLandgren, Malte
dc.date.accessioned2025-07-01T07:36:14Z
dc.date.issued2025
dc.date.submitted
dc.description.abstractThis thesis presents a reinforcement learning based approach for grading, applied in the Volvo excavator EC550E. The work was based around a simulation model of the excavator which facilitated easy training of the algorithm. A hydraulic controller was trained using proximal policy optimization. Together with the hydraulic controller, a PID was implemented as a positional controller for a complete system capable of performing grading tasks. Training was conducted by testing different reward functions and parameter choices to improve policy performance. The results showcases hyperparameter evaluation, velocity tracking accuracy for the hydraulic controller as well as grading accuracy of the complete system. The implemented solution had an accuracy of ± 4 cm during grading. However, the hydraulic controller was not able to consistently follow the target velocities in the cylinders, particularly for the bucket. In future works the hydraulic controller needs to be retrained for better precision before being deployed in a real machine. This thesis shows the potential and possibility of replacing traditional control policies with an machine-learning driven approach.
dc.identifier.coursecodeMMSX30
dc.identifier.urihttp://hdl.handle.net/20.500.12380/309790
dc.language.isoeng
dc.setspec.uppsokTechnology
dc.subjectautonomous excavation
dc.subjectexcavator
dc.subjecthydraulics
dc.subjectIMVT
dc.subjectmachine learning
dc.subjectproximal policy
dc.subjectoptimization
dc.subjectreinforcement learning
dc.titleAutonomous Excavation Using Reinforcement Learning with Proximal Policy Optimization
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeComplex adaptive systems (MPCAS), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
2025 Mårten Sanderöd & Oskar Tryggvason.pdf
Storlek:
4.22 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
2.35 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: