Prediction of Drug Metabolites Using a Deep Learning Language Model
dc.contributor.author | Dehlén, Amanda | |
dc.contributor.author | Aronsson, Pär | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för data och informationsteknik | sv |
dc.contributor.department | Chalmers University of Technology / Department of Computer Science and Engineering | en |
dc.contributor.examiner | Engkvist, Ola | |
dc.contributor.supervisor | Mercado Oropeza, Rocío | |
dc.date.accessioned | 2025-02-28T14:30:17Z | |
dc.date.available | 2025-02-28T14:30:17Z | |
dc.date.issued | 2024 | |
dc.date.submitted | ||
dc.description.abstract | The understanding of metabolism is essential in drug development, but conducting drug metabolism experiments is resource-intensive. To support this, in silico experiments using machine learning have been explored, with several tools available, but these rely on rule-based assessments and are restricted in their scalability. To build a better model for metabolite prediction in drug discovery, a deep neural network model called the Focused Transformer has been explored. For the model, metabolite data was gathered and curated. Several strategies were explored to improve the model’s performance, including a novel pretraining strategy involving pairs of structurally analogous molecules termed matched molecular pairs. The best derived model managed to find one true metabolite and had a validity of 4.5% when evaluated on an internal test set. While the model shows reasonable prediction for metabolite prediction, there is potential to achieve higher performance in future work and we conclude by suggesting several potential strategies that can be explored further, such as handling of data during training. | |
dc.identifier.coursecode | DATX05 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12380/309172 | |
dc.language.iso | eng | |
dc.setspec.uppsok | Technology | |
dc.subject | drug development | |
dc.subject | deep learning | |
dc.subject | drug metabolites | |
dc.subject | focused transformer | |
dc.subject | language model | |
dc.subject | metabolism | |
dc.subject | neural network | |
dc.title | Prediction of Drug Metabolites Using a Deep Learning Language Model | |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.degree | Master's Thesis | en |
dc.type.uppsok | H | |
local.programme | Computer science – algorithms, languages and logic (MPALG), MSc | |
local.programme | Data science and AI (MPDSC), MSc |