Interpreting Machine Learning Models using Conditional Counterfactual Generation

dc.contributor.authorMartinsson, Samuel
dc.contributor.departmentChalmers tekniska högskola / Institutionen för fysiksv
dc.contributor.departmentChalmers University of Technology / Department of Physicsen
dc.contributor.examinerBjerkeli, Per
dc.contributor.supervisorGillgren, Andreas
dc.date.accessioned2026-06-17T08:17:08Z
dc.date.issued2026
dc.date.submitted
dc.description.abstractWith the rapid development and application of complex machine learning models, the need to interpret the internal processes of such models have become increasingly relevant. In this thesis, a novel method for interpreting black box machine learning models is proposed, where an autoencoder is used to generate reconstructions of data to visualize in an interpretable way what patterns a model has learned to detect. The method is first shown to work for a simple constructed problem, being able to interpret a model that has learned to predict the mean of an underlying normal distribution from samples. It is then evaluated for a more complex problem, where a model has learned to classify the existence of disease in images from the CheXpert dataset of X-ray images. It is demonstrated that naively implementing the method to interpret this model leads to the autoencoder generating adversarial patterns to trick the model, instead of showing the an interpretable explanation of what the model has learned. To mitigate this issue, the thesis explores adding an additional model in the latent space of the conditional autoencoder and demonstrates that this can provide a certain degree of interpretability. Because of this, the method shows promise for interpreting black box models and with further research it might become viable for practical use.
dc.identifier.coursecodeSEEX30
dc.identifier.urihttps://hdl.handle.net/20.500.12380/311335
dc.language.isoeng
dc.setspec.uppsokPhysicsChemistryMaths
dc.subjectmachine learning, interpretability, autoencoders, counterfactual, chest X-ray images.
dc.titleInterpreting Machine Learning Models using Conditional Counterfactual Generation
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeEngineering mathematics and computational science (MPENM), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
Samuel_Martinsson.pdf
Size:
13.83 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Size:
2.35 KB
Format:
Item-specific license agreed upon to submission
Description: