Molecular Optimization using Deep Learning Extensions of the Transformer for Molecular Optimization

dc.contributor.authorForsberg, Marcus
dc.contributor.authorMattsson, Felix
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.date.accessioned2021-04-14T06:24:03Z
dc.date.available2021-04-14T06:24:03Z
dc.date.issued2020sv
dc.date.submitted2020
dc.description.abstractOver the recent years, the development in deep learning has provided new approaches to molecular optimization. Molecular optimization aims to find structurally similar molecules to a given starting molecule, yielding specified improvements in terms of different molecular properties. By representing molecules as SMILES, an ap proach to encode molecules as strings of tokens, molecular optimization can be framed as a machine translation problem, where starting molecules are translated to molecules with optimized properties. Previous work has shown success for the Transformer known from natural language processing [1, 2] in the area of molecular optimization. The thesis covers two extensions of the developed Transformer model in [1] through curriculum learning and Core-Fixed formulation. Through curriculum learning, training is structured through a sequence of tasks (curriculum) based on increasing difficulty. The curriculum could either be determined while training a model (machine-based) or manually (human heuristic-based). The thesis explores various approaches to human-based curriculum learning. For the other extension, Core-Fixed formulation, the thesis provides an approach to reformulating the input and output of the original model [1], which involves specifying in the input to the translation model which part that should be fixed (core) and which part that should be exchanged (R-group) to optimize the complete molecule’s properties. The results show advantages both in training time and molecule generation performance using the Core-Fixed formulation. For curriculum learning, the results do not indicate a clear improvement. The thesis suggests looking into more sophisticated curriculum learning approaches.sv
dc.identifier.urihttps://hdl.handle.net/20.500.12380/302297
dc.language.isoengsv
dc.setspec.uppsokTechnology
dc.subjectMolecular Optimizationsv
dc.subjectMatched Molecular Pairssv
dc.subjectTransformersv
dc.subjectAD-METsv
dc.subjectMaster’s Thesissv
dc.titleMolecular Optimization using Deep Learning Extensions of the Transformer for Molecular Optimizationsv
dc.type.degreeExamensarbete för masterexamensv
dc.type.uppsokH
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 21-12 Forsberg Mattsson.pdf
Storlek:
2.65 MB
Format:
Adobe Portable Document Format
Beskrivning:
License bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
1.14 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: