Modelling temporal context for traffic light recognition using RNNs

Examensarbete för masterexamen

Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.12380/302470
Download file(s):
File Description SizeFormat 
Master_thesis_David Freyr Bjornsson_och_Mattias Westerberg_210611.pdfModelling temporal context for traffic light recognition using RNNs13.35 MBAdobe PDFThumbnail
View/Open
Bibliographical item details
FieldValue
Type: Examensarbete för masterexamen
Title: Modelling temporal context for traffic light recognition using RNNs
Authors: Björnsson, David Freyr
Westerberg, Mattias
Abstract: Abstract The purpose of this thesis is to investigate whether or not including temporal context using recurrent neural networks in real-time object detection systems can improve detection performance in traffic light recognition. This was investigated using the DriveU traffic light dataset. Two variations of the YOLOv4 object detection system were created. The first variation is a LSTM which takes as input the bounding boxes predicted by YOLOv4 and outputs updated predictions. The second variation is a modification of the YOLOv4 network in which convolutional layers are replaced with convolutional LSTMs. With a limited number of experiments, it was found that the baseline model outperforms the more complicated sequential models. However, there is evidence that this is due to the sequential training strategy since the YOLOv4 baseline was outperformed by some sequential models when it adopted the sequential training strategy. The baseline YOLOv4 model achieved best performance on a held-out test set. The best sequential model achieved lower detection performance. When the baseline YOLOv4 was trained with the sequential training strategy, it achieved worse performance than the sequential models. Modelling temporal context using recurrent neural networks may improve detection performance, but answering the question requires an exhaustive search for a training strategy and model architecture. The analysis conducted in this thesis provides no evidence that modelling temporal context with YOLOv4 improves traffic light recognition performance on the DriveU dataset.
Keywords: object detection; traffic light recognition; recurrent neural networks; temporal context; YOLO
Issue Date: 2021
Publisher: Chalmers tekniska högskola / Institutionen för matematiska vetenskaper
URI: https://hdl.handle.net/20.500.12380/302470
Collection:Examensarbeten för masterexamen // Master Theses



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.