Identification of cardiovascular risk factors of COVID-19 patients using SHAP values for tree-based machine learning models

dc.contributor.authorBacklund, Johannes
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.examinerJohansson, Moa
dc.contributor.supervisorSeger, Carl-Johan
dc.contributor.supervisorYu, Yinan
dc.date.accessioned2021-04-28T06:45:13Z
dc.date.available2021-04-28T06:45:13Z
dc.date.issued2021sv
dc.date.submitted2020
dc.description.abstractThe exact cardiovascular risk factors involving COVID-19 patients are so far not fully known. This thesis uses two data sets (MIMIC, VGR) and tree-based machine learning models (Random forest, XGboost, LightGBM, CatBoost) to predict the outcome in mortality for pneumonia and COVID-19 patients. Using an algorithm known as Tree SHAP, the final trained tree model is interpreted together with distributions of mortality to identify the most important predictors (risk factors).The method used in this thesis produces intuitive graphs for analyzing risk factors by using supervised machine learning methods that focuses on creating models with good distinction ability. The same method could potentially be applied to identify mortality risk factors (or other types of risk factors) in the case of a new pandemic. The challenges, which needs to be carefully considered in applying this method, are mostly related to either having skewed data, unbalanced data or missing data points. The COVID-19 results show prevalence of risk factors such as; age, hypertension, chronic ischemic heart disease and diabetes.sv
dc.identifier.urihttps://hdl.handle.net/20.500.12380/302325
dc.language.isoengsv
dc.setspec.uppsokTechnology
dc.subjectSHAP valuessv
dc.subjectinterpretable machine learningsv
dc.subjectCOVID-19sv
dc.subjectPneumoniasv
dc.subjectMIMICsv
dc.subjectVGRsv
dc.subjecttree-based machine learning modelssv
dc.titleIdentification of cardiovascular risk factors of COVID-19 patients using SHAP values for tree-based machine learning modelssv
dc.type.degreeExamensarbete för masterexamensv
dc.type.uppsokH
Ladda ner
License bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
1.14 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: