Optimising Large Language Models for Vehicle Classification

dc.contributor.authorHjärtström, Filip
dc.contributor.authorJonängen, Simon
dc.contributor.departmentChalmers tekniska högskola / Institutionen för matematiska vetenskapersv
dc.contributor.examinerGebäck, Tobias
dc.contributor.supervisorMeddeb, Jonas
dc.contributor.supervisorMolin, Oskar
dc.date.accessioned2025-06-30T09:59:34Z
dc.date.issued2025
dc.date.submitted
dc.description.abstractThis thesis examines the use of Large Language Models optimised for vehicle classification within the insurance industry. Traditional methods at If P&C Insurance suffer from inaccuracies and scalability issues due to unstructured text data from manual inputs and relatively basic techniques. To address this, our study utilises Parameter-Efficient Fine-Tuning and Low-Rank Adaptation on standardised, manually labelled vehicle data. Our approach combined careful prompt engineering, dataset preprocessing, hyperparameter optimisation via the Nondominated Sorting Genetic Algorithm II, and thorough evaluation across different base models, including various Llama model sizes, DeepSeek, Mistral, and Phi. Through this integrated methodology, we achieved a final model accuracy of 96.8% on hold-out data, using a fine-tuned Llama 3.1 model with 8 billion parameters. Despite the model’s relatively modest size, targeted adaptation enabled it to outperform larger proprietary models such as GPT-4o on the specific classification task. Implementation aspects, including computational needs, cost efficiency, and human-in-the-loop strategies, are also discussed. Our deployment framework emphasises selective human review based on model confidence, enabling a sustainable balance between automation and accuracy. Financial and infrastructure considerations showed that fine-tuned opensource models could offer significant cost savings compared to API-based solutions. In conclusion, this research presents a scalable, cost-effective, and high-performing solution that enhances vehicle classification, leading to improved risk segmentation and pricing precision in the insurance sector. The results demonstrate that finetuned open-source LLMs, when carefully adapted, can rival and even surpass much larger commercial models in domain-specific applications, offering a viable path for modernising traditional insurance workflows.
dc.identifier.coursecodeMVEX03
dc.identifier.urihttp://hdl.handle.net/20.500.12380/309762
dc.language.isoeng
dc.setspec.uppsokPhysicsChemistryMaths
dc.subjectAI, LLM, PEFT, LoRA, machine learning, supervised learning, unsupervised learning, big data, classification, risk-based pricing.
dc.titleOptimising Large Language Models for Vehicle Classification
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeEngineering mathematics and computational science (MPENM), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
Master_Thesis_Simon Jonängen Filip Hjärtström_2025.pdf
Storlek:
1.19 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
2.35 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: