3D Object Detection with Perceiver Adapting the Perceiver Architecture for 3D Object Detection with Automotive Image Data

dc.contributor.authorSedin, Linnéa
dc.contributor.authorAndersson, Lydia
dc.contributor.departmentChalmers tekniska högskola / Institutionen för elektrotekniksv
dc.contributor.examinerKahl, Fredrik
dc.contributor.supervisorVerbeke, Willem
dc.contributor.supervisorFatemi, Maryam
dc.contributor.supervisorJohnander, Joakim
dc.date.accessioned2024-06-19T11:59:53Z
dc.date.available2024-06-19T11:59:53Z
dc.date.issued2024
dc.date.submitted
dc.description.abstractAbstract In the field of autonomous vehicles, accurately understanding and interpreting the environment and surroundings is vital for safe navigation. A robust perception system is fundamental in achieving this, and central to this is the use of computer vision techniques. This project investigates the adaptation of Perceiver [13] for 3D object detection using automotive image data. While the Perceiver architecture, which is based on Transformers, is designed for perception tasks, it has neither been evaluated on large input sizes nor tested on 3D object detection tasks. By using the ZOD dataset [2], which provides a comprehensive collection of automotive image data, this study aims to assess Perceiver’s capabilities in this context. Specifically, 3D object detection is exclusively done on camera images, targeting only vehicle objects. The proposed model in this project implements end-to-end object detection where the loss computation is inspired by the DETR loss computation [4]. The results indicate that the Perceiver architecture does not work without additional configuration for 3D object detection. Challenges in adapting the model highlight the need for further modifications and optimizations to improve its performance in this specific application. Despite these challenges, the study provides valuable insights into the potential and limitations of using the Perceiver architecture for 3D object detection with large input data. This study also provides insight into the impact of different additional modifications to the basic architecture.
dc.identifier.coursecodeEENX30
dc.identifier.urihttp://hdl.handle.net/20.500.12380/307937
dc.language.isoeng
dc.setspec.uppsokTechnology
dc.subjectKeywords: 3D Object Detection, Artificial Intelligence, Automotive Sensing, Deep Machine Learning, DETR, Perceiver, Transformer v
dc.title3D Object Detection with Perceiver Adapting the Perceiver Architecture for 3D Object Detection with Automotive Image Data
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeSystems, control and mechatronics (MPSYS), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
Msc_linnea.pdf
Storlek:
7.75 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
2.35 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: