Annotation-free Learning for Sensor Fusion in ADAS

dc.contributor.authorBjörkman, Maria
dc.contributor.authorTvingby, Ludvig
dc.contributor.departmentChalmers tekniska högskola / Institutionen för elektrotekniksv
dc.contributor.examinerHammarstrand, Lars
dc.contributor.supervisorWang, Tzu-Jui
dc.contributor.supervisorPriisalu, Maria
dc.date.accessioned2025-12-01T13:59:47Z
dc.date.issued2025
dc.date.submitted
dc.description.abstractVehicle automation has the potential to significantly improve road safety. Achieving comprehensive vehicle perception requires systems that optimally combine information from multiple sensor modalities. Such systems leverage the strengths of each modality while compensating for their weaknesses. By continuously encoding and fusing information from cameras, LiDARs, RADARs and the motion of the egovehicle, a dynamic representation of the surrounding environment can be created and maintained. A major challenge for these systems is the large amount of annotated data required for training, as manual labelling creates a significant bottleneck for scalability. In this study, a pre-training task for a multi-modal machine learning model was implemented and evaluated. To circumvent labour-intensive labelling, self-supervision was employed, with both the model input and the supervision signal involving annotation-free data. The pre-training aimed to learn general features related to sensor pose changes by predicting ego-vehicle pose changes using odometry data. To assess pre-training performance, the features were then used as initial weights for fine-tuning a perception model. The performance of the perception model using baseline weights trained on annotated data was similar to that using weights trained on annotation-free data, indicating that the proposed method is viable. However, further testing is required to establish statistical significance. Future work could explore implementing attention-based methods for feature matching between scene representations to improve model performance.
dc.identifier.coursecodeEENX30
dc.identifier.urihttp://hdl.handle.net/20.500.12380/310781
dc.language.isoeng
dc.setspec.uppsokTechnology
dc.subjectADAS
dc.subjectAnnotation-free
dc.subjectEgo-vehicle
dc.subjectMulti-modal
dc.subjectPerception
dc.subjectPretraining
dc.subjectSensor Fusion
dc.subjectTransformer
dc.titleAnnotation-free Learning for Sensor Fusion in ADAS
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeInformation and communication technology (MPICT​), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
EENX30_Thesis_Annotation_free.pdf
Storlek:
2.7 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
2.35 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: