Discovering Novel Chemical Reactions

dc.contributor.authorRydholm, Emma
dc.contributor.authorSvensson, Emma
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.examinerDamaschke, Peter
dc.contributor.supervisorHaghir Chehreghani, Morteza
dc.date.accessioned2021-06-17T06:52:52Z
dc.date.available2021-06-17T06:52:52Z
dc.date.issued2021sv
dc.date.submitted2020
dc.description.abstractAccurately predicting chemical reactions can facilitate the search for optimal synthe sis routes in a chemical reaction network and as a consequence expedite the lengthy drug discovery process. As an effort in this direction, this work aims to explore AstraZeneca’s chemical knowledge graph by two complementary analyses. In a first part, graph theory related statistics is employed as a means to gain insights about the chemical reaction graph at AstraZeneca. Significant differences are observed be tween this internal reaction graph and the one based on the public dataset of United States patents as well as other reaction graphs discussed in literature. Secondly, a link prediction model is applied to and evaluated on AstraZeneca’s chemical reaction graph, in order to suggest new potential chemical reactions. In order to successfully accomplish this task, an existing link prediction model is adapted and trained. The test results are then compared to heuristic baselines, showing that the proposed implementation substantially exceeds what can be achieved with heuristic methods. One of the contribution from this research is a comparison between different ways to sample the ground truth class of non-existing links for training and evaluation. The choice of method for this task is shown to have an impact on the final predictions. Finally, a set of promising, predicted reactions are suggested and is currently under further investigation at AstraZeneca.sv
dc.identifier.coursecodeMPCASsv
dc.identifier.urihttps://hdl.handle.net/20.500.12380/302579
dc.language.isoengsv
dc.setspec.uppsokTechnology
dc.subjectLink Predictionsv
dc.subjectGraph Neural Networkssv
dc.subjectKnowledge Graphsv
dc.subjectChemical Reaction Graphsv
dc.subjectGraph Analysissv
dc.subjectSynthesis Predictionsv
dc.subjectDrug Discoverysv
dc.titleDiscovering Novel Chemical Reactionssv
dc.type.degreeExamensarbete för masterexamensv
dc.type.uppsokH
local.programmeComplex adaptive systems (MPCAS), MSc
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 21-43 Rydholm Svensson.pdf
Storlek:
10.44 MB
Format:
Adobe Portable Document Format
Beskrivning:
License bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
1.14 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: