Predictive AI for Hepatic Safety: A dual analysis of CYP450 time-dependent inhibition and trapping assays using supervised learning models

dc.contributor.authorRamos Marca, Maria Virginia
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.departmentChalmers University of Technology / Department of Computer Science and Engineeringen
dc.contributor.examinerEngkvist, Ola
dc.contributor.supervisorSubramanian, Vigneshwari
dc.date.accessioned2025-10-20T13:54:13Z
dc.date.issued2025
dc.date.submitted
dc.description.abstractThis work explores the development and evaluation of machine learning models for predicting toxicity-related endpoints, focusing on time-dependent inhibition of cytochrome P450 enzymes and reactivity in trapping assays (glutathione, potassium cyanide, and methoxylamine). A variety of modeling strategies were assessed, including decision trees and Chemprop neural networks in both single-task and multitask configurations. Model performances were estimated using temporally split datasets to better reflect real-world prediction scenarios. While tree-based models consistently delivered more stable and balanced results, Chemprop models showed greater sensitivity to class imbalance, data partitioning, and representation. Attempts to mitigate these issues using data resampling techniques, additional molecular descriptors, and scaffold-based data reduction led to limited improvements. Further analysis of feature distributions and chemical space connectivity highlighted key challenges, such as weak class separation in descriptor values and structural isolation of test compounds, especially under temporal splits. In the case of trapping assays, multitask learning failed to improve generalization, likely due to the biological heterogeneity of the endpoints. Overall, results emphasize that data limitations are the primary bottleneck. Enhancing chemical diversity, improving feature representations, and tailoring models to specific endpoint properties appear critical for achieving more robust predictions in toxicity modeling.
dc.identifier.coursecodeDATX05
dc.identifier.urihttp://hdl.handle.net/20.500.12380/310656
dc.language.isoeng
dc.setspec.uppsokTechnology
dc.subjectCYP450 inhibition
dc.subjectTrapping Assays
dc.subjectMachine Learning
dc.subjectDecision Trees
dc.subjectChemprop
dc.subjectToxicology Prediction
dc.subjectImbalanced Data
dc.titlePredictive AI for Hepatic Safety: A dual analysis of CYP450 time-dependent inhibition and trapping assays using supervised learning models
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeBiomedical engineering (MPMED), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 25-98 MR.pdf
Storlek:
5.72 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
2.35 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: