Machine learning based warning system for failed procurement classification documents

dc.contributor.authorTzinieris, Anastasios
dc.contributor.departmentChalmers tekniska högskola / Institutionen för matematiska vetenskapersv
dc.contributor.examinerPicchini, Umberto
dc.contributor.supervisorSärkkä, Aila
dc.date.accessioned2022-06-20T07:40:39Z
dc.date.available2022-06-20T07:40:39Z
dc.date.issued2022sv
dc.date.submitted2020
dc.description.abstractWarning systems in the Machine Learning field of study, is a tool that generates a warning based on a model’s prediction results. This thesis’s study topic is to create such system to identify possible problematic procurement classification documents. Given a database of a company, a dataset was created for which a feature analysis was made to investigate which properties of a document can cause an either classification or formatting error. The challenging part of the research was the feature engineering since each feature had to be preprocessed differently based on the importance of the information contained. Moreover, different supervised machine learning methods were implemented and hyperparameter tuned, using an algorithm called Grid Search. After the evaluation and comparison of the models, XGBoost Classifier was found to be the most successful both in terms of performance and computational time achieving 90,5% accuracy. However, by gathering more data, especially containing formatting errors, it is anticipated that the performance of the warning system using the XGBoost will be improved.sv
dc.identifier.coursecodeMVEX03sv
dc.identifier.urihttps://hdl.handle.net/20.500.12380/304792
dc.language.isoengsv
dc.setspec.uppsokPhysicsChemistryMaths
dc.subjectWarning system, supervised learning, machine learning, feature engineering, XGBoost Classifiersv
dc.titleMachine learning based warning system for failed procurement classification documentssv
dc.type.degreeExamensarbete för masterexamensv
dc.type.uppsokH
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
Master_Thesis_Anastasios_Tzineris_2022.pdf
Storlek:
2.6 MB
Format:
Adobe Portable Document Format
Beskrivning:
License bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
1.51 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: