Holistic Diagnosis via Multimodal Foundation Models
dc.contributor.author | Pauli, Oskar | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för elektroteknik | sv |
dc.contributor.examiner | Graell i Amat, Alexandre | |
dc.contributor.supervisor | Ceccobello, Chiara | |
dc.contributor.supervisor | Östman, Johan | |
dc.date.accessioned | 2024-07-17T10:03:29Z | |
dc.date.available | 2024-07-17T10:03:29Z | |
dc.date.issued | 2024 | |
dc.date.submitted | ||
dc.description.abstract | The healthcare domain has data in many different forms, or modalities. They can be in the form of x-ray images, time-series of certain events like heart rate or blood pressure, textual data from notes etc. Medical practitioners uses many different modalities every day to make informed and sound decisions. With the recent success of small and large language models, it is natural to try and incorporate them with multimodal capabilities in the healtcare domain. This thesis seeks to investigate how well small language models can perform on predictive tasks in healthcare using multimodal data. To explore this, projectors that project data from different sources to the embedding space of a language model was developed. While the results show that a multimodal language model is better than a single-sourced version, it is still being outperformed by the XGBoost model. Even though it is being outperformed, the model proposed shows promise in regards to generalizability, potentially streamlining predictive tasks in healthcare. The thesis argue that even if improvements needs to be made and the challenges it poses can be difficult to handle, further advancements can lead to facilitating medical practitioners in a very efficient way. | |
dc.identifier.coursecode | EENX30 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12380/308311 | |
dc.language.iso | eng | |
dc.setspec.uppsok | Technology | |
dc.subject | ML | |
dc.subject | Language Models | |
dc.subject | Healthcare | |
dc.subject | Multi-label Classification | |
dc.subject | SHAP | |
dc.title | Holistic Diagnosis via Multimodal Foundation Models | |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.degree | Master's Thesis | en |
dc.type.uppsok | H | |
local.programme | Engineering mathematics and computational science (MPENM), MSc |