Interpreting Machine Learning Models using Conditional Counterfactual Generation

Martinsson, Samuel

Interpreting Machine Learning Models using Conditional Counterfactual Generation

dc.contributor.author	Martinsson, Samuel
dc.contributor.department	Chalmers tekniska högskola / Institutionen för fysik	sv
dc.contributor.department	Chalmers University of Technology / Department of Physics	en
dc.contributor.examiner	Bjerkeli, Per
dc.contributor.supervisor	Gillgren, Andreas
dc.date.accessioned	2026-06-17T08:17:08Z
dc.date.issued	2026
dc.date.submitted
dc.description.abstract	With the rapid development and application of complex machine learning models, the need to interpret the internal processes of such models have become increasingly relevant. In this thesis, a novel method for interpreting black box machine learning models is proposed, where an autoencoder is used to generate reconstructions of data to visualize in an interpretable way what patterns a model has learned to detect. The method is first shown to work for a simple constructed problem, being able to interpret a model that has learned to predict the mean of an underlying normal distribution from samples. It is then evaluated for a more complex problem, where a model has learned to classify the existence of disease in images from the CheXpert dataset of X-ray images. It is demonstrated that naively implementing the method to interpret this model leads to the autoencoder generating adversarial patterns to trick the model, instead of showing the an interpretable explanation of what the model has learned. To mitigate this issue, the thesis explores adding an additional model in the latent space of the conditional autoencoder and demonstrates that this can provide a certain degree of interpretability. Because of this, the method shows promise for interpreting black box models and with further research it might become viable for practical use.
dc.identifier.coursecode	SEEX30
dc.identifier.uri	https://hdl.handle.net/20.500.12380/311335
dc.language.iso	eng
dc.setspec.uppsok	PhysicsChemistryMaths
dc.subject	machine learning, interpretability, autoencoders, counterfactual, chest X-ray images.
dc.title	Interpreting Machine Learning Models using Conditional Counterfactual Generation
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.degree	Master's Thesis	en
dc.type.uppsok	H
local.programme	Engineering mathematics and computational science (MPENM), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: Samuel_Martinsson.pdf
Size:: 13.83 MB
Format:: Adobe Portable Document Format

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Size:: 2.35 KB
Format:: Item-specific license agreed upon to submission
Description:

Ladda ner

Samlingar

Examensarbeten för masterexamen