Predicting patient outcome from clinical journals and biomedical articles: Using the MIMIC-IV database, multiple in-hospital mortality prediction models are created, to which improvements are attempted through the use of word embeddings trained on scientific biomedical literature

Examensarbete för masterexamen

Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.12380/304531
Download file(s):
File Description SizeFormat 
CSE 22-07 Henriksson Svensson.pdfPredicting patient outcome from clinical journals and biomedical articles3.26 MBAdobe PDFView/Open
Bibliographical item details
FieldValue
Type: Examensarbete för masterexamen
Title: Predicting patient outcome from clinical journals and biomedical articles: Using the MIMIC-IV database, multiple in-hospital mortality prediction models are created, to which improvements are attempted through the use of word embeddings trained on scientific biomedical literature
Authors: Henriksson, Fannie
Svensson, Philip
Abstract: There exists an immense amount of data within the medical and healthcare field. Two examples of these are the patient’s clinical records and biomedical research literature. The clinical records often contain numerical information suitable for developing AI-driven systems to support medical workers, and with some work, we believe that data from the biomedical literature can also be used to improve these systems. In this work, clinical records from ICU patients in the MIMIC-IV database are used to create multiple prediction models to predict the risk of a subject passing away during their hospital stay (in-hospital mortality prediction). From the clinical records, abnormal observations are categorised into medical terms which are then used to find relevant biomedical research work from the PubMed database. This data is used to train word embeddings using Word2Vec. The word vectors for these categorised observations are then added as a concatenation to the clinical records data, in hopes of improving the predictions. The results show very little difference with the additional information from the word embeddings. These are small improvements but unfortunately, we can not conclude that this can help improve such prediction models. We do however believe it suggest that there is valuable information that can be extracted from biomedical articles and that further work could show this.
Keywords: AI;ML;NLP;healthcare;data engineering;Word2Vec;MIMIC-IV
Issue Date: 2022
Publisher: Chalmers tekniska högskola / Institutionen för data och informationsteknik
URI: https://hdl.handle.net/20.500.12380/304531
Collection:Examensarbeten för masterexamen // Master Theses



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.