Training end-to-end planners in sensor-level simulation

dc.contributor.authorÁlvarez Guinarte, Miguel
dc.contributor.departmentChalmers tekniska högskola / Institutionen för elektrotekniksv
dc.contributor.examinerKahl, Fredrik
dc.contributor.supervisorJohnander Faxén, Joakim
dc.contributor.supervisorTaveira, Bernardo
dc.date.accessioned2026-06-23T17:04:35Z
dc.date.issued2026
dc.date.submitted
dc.description.abstractEnd-to-end autonomous driving planners are commonly trained with imitation learning on offline expert demonstrations, an approach that can achieve strong openloop performance. However, planners following this paradigm usually suffer during closed-loop deployment, since they have not been exposed to the consequences of their own actions during training. Addressing this requires closed-loop training, which has typically involved high-level representations, forgoing the benefits of sensor-level end-to-end planning. This work investigates whether recent advances in efficient 3D Gaussian Splatting can make closed-loop training of sensor-level end-to-end planners feasible. A closedloop simulator is developed by reconstructing driving sequences and integrating them with a trajectory tracker and a kinematic vehicle model capable of executing the planner’s predicted trajectories. This makes it possible to render new views that deviate from the original logged trajectory. The resulting framework is used to fine-tune a pretrained Latent TransFuser planner through reinforcement learning and closed-loop imitation learning strategies. The simulator shows that closed-loop execution reveals failure modes that are not visible from open-loop evaluation, highlighting the importance of closed-loop training and evaluation. The explored training strategies produced mixed but some modest improvements over the baseline, supporting the feasibility of Gaussian Splattingbased sensor-level simulation as a training platform and laying the groundwork for future work on scalable closed-loop learning for end-to-end autonomous driving.
dc.identifier.coursecodeEENX30
dc.identifier.urihttps://hdl.handle.net/20.500.12380/311477
dc.language.isoeng
dc.setspec.uppsokTechnology
dc.subjectautonomous driving
dc.subjectend-to-end planning
dc.subjectclosed-loop training
dc.subjectsensorlevel simulation
dc.subject3D Gaussian Splatting
dc.subjectimitation learning
dc.subjectreinforcement learning
dc.titleTraining end-to-end planners in sensor-level simulation
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeComplex adaptive systems (MPCAS), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
MasterThesis_MiguelAlvarezGuinarte.pdf
Size:
54.01 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Size:
2.35 KB
Format:
Item-specific license agreed upon to submission
Description: