Evaluation of models for forecasting traffic speed and classifying traffic delay in European cities

dc.contributor.authorGHAFFARPOUR, ARMAND
dc.contributor.authorHANSSON, OSCAR
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.examinerJohansson, Daniel
dc.contributor.supervisorVerendel, Vilhelm
dc.date.accessioned2021-08-26T05:59:52Z
dc.date.available2021-08-26T05:59:52Z
dc.date.issued2021sv
dc.date.submitted2020
dc.description.abstractCongestion and traffic delays are big challenges that cities face today. In this the sis, we use traffic speed data from 2018 for forecasting traffic speeds and classifying traffic delays. The data consists of 15-minute time-intervals and covers 15 European cities. As a starting point we investigate if machine learning can be used to forecast traffic speeds, specifically if more advanced forecasting models improve upon simple statistical models. It is also investigated if weather variables such as precipitation and temperature improve the forecasting models. Forecasting models require con tinuous traffic speed input data in order to make predictions about the future. To create more generic models capable of making predictions regardless of the current traffic speed, binary classification models are used with the goal of classifying if there is a delay or not at some point in time. The traffic speed data is transformed into a binary value, 1 for when there is a delay and 0 for when there is no delay. This is used as the target variable of the models. For the classification models, we use temporal features, weather features and graph-related centrality features that describe the road location. We also take into account the infrastructure along a road as well as the area around a road that might have an impact on traffic con gestion. A larger portion of this work is focused on the binary classification with regards to traffic delay. The classification is done to answer three questions. What are the most important features for creating classification models? How well do the models perform on untrained city data? Do the models improve when increasing the number of training cities? Regardless if weather data was used in conjunction with traffic speed data as input to the model, more advanced forecasting models didn’t improve the performance significantly. With regards to the classification results, the most important features were found to be related to public transportation, where bus stops were the most prominent feature followed by schools. Moreover, it was shown that generalizing the trained models to new city data was indeed possible and that it performed better than a random classifier (a classifier that guesses the class for an input). Finally the results showed an overall increase in performance of the classifiers when increasing the number of training cities but more work is needed for optimizing the models in the future.sv
dc.identifier.coursecodeMPALGsv
dc.identifier.urihttps://hdl.handle.net/20.500.12380/303998
dc.language.isoengsv
dc.setspec.uppsokTechnology
dc.subjectcomputer sciencesv
dc.subjectthesissv
dc.subjecttraffic datasv
dc.subjectmachine learningsv
dc.subjectclassificationsv
dc.subjectforecastingsv
dc.subjectcongestionsv
dc.subjecttime seriessv
dc.titleEvaluation of models for forecasting traffic speed and classifying traffic delay in European citiessv
dc.type.degreeExamensarbete för masterexamensv
dc.type.uppsokH
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 21-126 Ghaffarpour Hansson.pdf
Storlek:
5.7 MB
Format:
Adobe Portable Document Format
Beskrivning:
License bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
1.51 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: