Training end-to-end planners in sensor-level simulation

Álvarez Guinarte, Miguel

Training end-to-end planners in sensor-level simulation

dc.contributor.author	Álvarez Guinarte, Miguel
dc.contributor.department	Chalmers tekniska högskola / Institutionen för elektroteknik	sv
dc.contributor.examiner	Kahl, Fredrik
dc.contributor.supervisor	Johnander Faxén, Joakim
dc.contributor.supervisor	Taveira, Bernardo
dc.date.accessioned	2026-06-23T17:04:35Z
dc.date.issued	2026
dc.date.submitted
dc.description.abstract	End-to-end autonomous driving planners are commonly trained with imitation learning on offline expert demonstrations, an approach that can achieve strong openloop performance. However, planners following this paradigm usually suffer during closed-loop deployment, since they have not been exposed to the consequences of their own actions during training. Addressing this requires closed-loop training, which has typically involved high-level representations, forgoing the benefits of sensor-level end-to-end planning. This work investigates whether recent advances in efficient 3D Gaussian Splatting can make closed-loop training of sensor-level end-to-end planners feasible. A closedloop simulator is developed by reconstructing driving sequences and integrating them with a trajectory tracker and a kinematic vehicle model capable of executing the planner’s predicted trajectories. This makes it possible to render new views that deviate from the original logged trajectory. The resulting framework is used to fine-tune a pretrained Latent TransFuser planner through reinforcement learning and closed-loop imitation learning strategies. The simulator shows that closed-loop execution reveals failure modes that are not visible from open-loop evaluation, highlighting the importance of closed-loop training and evaluation. The explored training strategies produced mixed but some modest improvements over the baseline, supporting the feasibility of Gaussian Splattingbased sensor-level simulation as a training platform and laying the groundwork for future work on scalable closed-loop learning for end-to-end autonomous driving.
dc.identifier.coursecode	EENX30
dc.identifier.uri	https://hdl.handle.net/20.500.12380/311477
dc.language.iso	eng
dc.setspec.uppsok	Technology
dc.subject	autonomous driving
dc.subject	end-to-end planning
dc.subject	closed-loop training
dc.subject	sensorlevel simulation
dc.subject	3D Gaussian Splatting
dc.subject	imitation learning
dc.subject	reinforcement learning
dc.title	Training end-to-end planners in sensor-level simulation
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.degree	Master's Thesis	en
dc.type.uppsok	H
local.programme	Complex adaptive systems (MPCAS), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: MasterThesis_MiguelAlvarezGuinarte.pdf
Size:: 54.01 MB
Format:: Adobe Portable Document Format

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Size:: 2.35 KB
Format:: Item-specific license agreed upon to submission
Description:

Ladda ner

Samlingar

Examensarbeten för masterexamen