Prediction of Drug Metabolites Using a Deep Learning Language Model

dc.contributor.authorDehlén, Amanda
dc.contributor.authorAronsson, Pär
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.departmentChalmers University of Technology / Department of Computer Science and Engineeringen
dc.contributor.examinerEngkvist, Ola
dc.contributor.supervisorMercado Oropeza, Rocío
dc.date.accessioned2025-02-28T14:30:17Z
dc.date.available2025-02-28T14:30:17Z
dc.date.issued2024
dc.date.submitted
dc.description.abstractThe understanding of metabolism is essential in drug development, but conducting drug metabolism experiments is resource-intensive. To support this, in silico experiments using machine learning have been explored, with several tools available, but these rely on rule-based assessments and are restricted in their scalability. To build a better model for metabolite prediction in drug discovery, a deep neural network model called the Focused Transformer has been explored. For the model, metabolite data was gathered and curated. Several strategies were explored to improve the model’s performance, including a novel pretraining strategy involving pairs of structurally analogous molecules termed matched molecular pairs. The best derived model managed to find one true metabolite and had a validity of 4.5% when evaluated on an internal test set. While the model shows reasonable prediction for metabolite prediction, there is potential to achieve higher performance in future work and we conclude by suggesting several potential strategies that can be explored further, such as handling of data during training.
dc.identifier.coursecodeDATX05
dc.identifier.urihttp://hdl.handle.net/20.500.12380/309172
dc.language.isoeng
dc.setspec.uppsokTechnology
dc.subjectdrug development
dc.subjectdeep learning
dc.subjectdrug metabolites
dc.subjectfocused transformer
dc.subjectlanguage model
dc.subjectmetabolism
dc.subjectneural network
dc.titlePrediction of Drug Metabolites Using a Deep Learning Language Model
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeComputer science – algorithms, languages and logic (MPALG), MSc
local.programmeData science and AI (MPDSC), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 24-121 AD PA.pdf
Storlek:
2.34 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
2.35 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: