Implementing a machine learning microservice for scoring and predicting vehicle driving attributes and their impact on costs

dc.contributor.authorBlom, Johan
dc.contributor.authorSohrabpour, Sam
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.departmentChalmers University of Technology / Department of Computer Science and Engineeringen
dc.contributor.examinerRanta, Aarne
dc.contributor.supervisorSeger, Carl-Johan
dc.date.accessioned2022-12-05T08:48:53Z
dc.date.available2022-12-05T08:48:53Z
dc.date.issued2022
dc.date.submitted2022
dc.description.abstractMinimizing automotive insurance costs and other forms of operating costs has become a new priority within the vehicle industry as it is included in the expanding subscription based business model for vehicles. Research has shown that automotive insurance costs can be decreased if the automotive company manages to prove to the insurance company that driving behavior of the vehicles are better than expected. It has also been shown that additional operating costs such as sending a replacement car during service are also expenses which would be reduced by better driving behavior. This study aims to provide an analysis that informs automobile companies how driving behavior attributes (trip data) affects operating cost. Through the use of machine learning models, the question is, if an automobile brand has trip data available, is it possible to create an analysis that can accurately predict service costs and other operating costs for the vehicle? The analysis was made through the use of a machine learning model using a supervised algorithm called extreme gradient boosting. The machine learning model has been trained using trip data and operating cost data, where the model processes trip data to predict the likelihood of additional operating costs exceeding 10.000kr. In the absence of real data, the data used in the thesis was generated based on car statistics, not from real cars. The work was done with a micro-service structure, meaning multiple small services communicated with each other through API:s. The analysis for the final model demonstrated that it is possible to predict operating costs with a fairly good accuracy according to several evaluation metrics that was used to evaluate the model. The final model resulted with a 57% accuracy in finding vehicles with additional operating costs and a 92% accuracy in finding vehicles without additional operating costs. The results indicate that the dataset is too imbalanced due to that the rarity of requiring additional operating cost. This was handled by using the average trip data per car instead of processing all trip data individually. The machine learning models accuracy significantly increased once the imbalance ratio went above one car that requires operating cost for every 30 cars that does not.
dc.identifier.coursecodeDATX05
dc.identifier.urihttps://odr.chalmers.se/handle/20.500.12380/305871
dc.language.isoeng
dc.setspec.uppsokTechnology
dc.subjectPython
dc.subjectMicro-services
dc.subjectMachine Learning
dc.subjectKubernetes
dc.subjectAzure ML
dc.subjectXGBoost
dc.titleImplementing a machine learning microservice for scoring and predicting vehicle driving attributes and their impact on costs
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeComputer science – algorithms, languages and logic (MPALG), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 22-116 Blom Sohrabpour.pdf
Storlek:
2.33 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
1.64 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: