Improving perception systems for autonomous driving
dc.contributor.author | Kalander, Gustav | |
dc.contributor.author | Hoti, Faton | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för elektroteknik | sv |
dc.contributor.examiner | Wymeersch, Henk | |
dc.contributor.supervisor | Johnander, Joakim | |
dc.contributor.supervisor | Gustafsson, Niklas | |
dc.date.accessioned | 2025-06-17T12:59:11Z | |
dc.date.issued | 2025 | |
dc.date.submitted | ||
dc.description.abstract | Transformers have become a cornerstone of modern deep learning. Typically, a transformer layer comprises attention, normalization, dropout, and a feed-forward network (FFN). This work investigates the role of the FFN in transformer-based 3D object detection by exploring two modifications: (1) replacing the FFN with a mixture of experts layer to enhance model capacity, and (2) progressively reducing—and ultimately removing—the FFN to assess its necessity. Surprisingly, neither approach led to measurable changes in detection performance, suggesting that the FFN may be functionally redundant in this context. Further experiments revealed that the model retained full performance even when the FFN was entirely eliminated, challenging the conventional assumption that FFNs are indispensable in transformer architectures. These findings raise questions about the necessity of FFNs in perception tasks, contrasting with their established empirically demonstrated importance in NLP. The results also suggest potential avenues for designing leaner, more efficient transformer variants by omitting the FFN. | |
dc.identifier.coursecode | EENX30 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12380/309500 | |
dc.language.iso | eng | |
dc.relation.ispartofseries | 00000 | |
dc.setspec.uppsok | Technology | |
dc.subject | Transformer | |
dc.subject | 3D Object Detection | |
dc.subject | Feed-Forward Network | |
dc.subject | Mixture of Experts | |
dc.subject | Model Efficiency | |
dc.subject | Architectural Redundancy | |
dc.title | Improving perception systems for autonomous driving | |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.degree | Master's Thesis | en |
dc.type.uppsok | H | |
local.programme | Computer systems and networks (MPCSN), MSc |