Interpreting Machine Learning Models using Conditional Counterfactual Generation
| dc.contributor.author | Martinsson, Samuel | |
| dc.contributor.department | Chalmers tekniska högskola / Institutionen för fysik | sv |
| dc.contributor.department | Chalmers University of Technology / Department of Physics | en |
| dc.contributor.examiner | Bjerkeli, Per | |
| dc.contributor.supervisor | Gillgren, Andreas | |
| dc.date.accessioned | 2026-06-17T08:17:08Z | |
| dc.date.issued | 2026 | |
| dc.date.submitted | ||
| dc.description.abstract | With the rapid development and application of complex machine learning models, the need to interpret the internal processes of such models have become increasingly relevant. In this thesis, a novel method for interpreting black box machine learning models is proposed, where an autoencoder is used to generate reconstructions of data to visualize in an interpretable way what patterns a model has learned to detect. The method is first shown to work for a simple constructed problem, being able to interpret a model that has learned to predict the mean of an underlying normal distribution from samples. It is then evaluated for a more complex problem, where a model has learned to classify the existence of disease in images from the CheXpert dataset of X-ray images. It is demonstrated that naively implementing the method to interpret this model leads to the autoencoder generating adversarial patterns to trick the model, instead of showing the an interpretable explanation of what the model has learned. To mitigate this issue, the thesis explores adding an additional model in the latent space of the conditional autoencoder and demonstrates that this can provide a certain degree of interpretability. Because of this, the method shows promise for interpreting black box models and with further research it might become viable for practical use. | |
| dc.identifier.coursecode | SEEX30 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.12380/311335 | |
| dc.language.iso | eng | |
| dc.setspec.uppsok | PhysicsChemistryMaths | |
| dc.subject | machine learning, interpretability, autoencoders, counterfactual, chest X-ray images. | |
| dc.title | Interpreting Machine Learning Models using Conditional Counterfactual Generation | |
| dc.type.degree | Examensarbete för masterexamen | sv |
| dc.type.degree | Master's Thesis | en |
| dc.type.uppsok | H | |
| local.programme | Engineering mathematics and computational science (MPENM), MSc |
