Molecular Optimization using Deep Learning Extensions of the Transformer for Molecular Optimization

Forsberg, Marcus; Mattsson, Felix

Molecular Optimization using Deep Learning Extensions of the Transformer for Molecular Optimization

dc.contributor.author	Forsberg, Marcus
dc.contributor.author	Mattsson, Felix
dc.contributor.department	Chalmers tekniska högskola / Institutionen för data och informationsteknik	sv
dc.date.accessioned	2021-04-14T06:24:03Z
dc.date.available	2021-04-14T06:24:03Z
dc.date.issued	2020	sv
dc.date.submitted	2020
dc.description.abstract	Over the recent years, the development in deep learning has provided new approaches to molecular optimization. Molecular optimization aims to find structurally similar molecules to a given starting molecule, yielding specified improvements in terms of different molecular properties. By representing molecules as SMILES, an ap proach to encode molecules as strings of tokens, molecular optimization can be framed as a machine translation problem, where starting molecules are translated to molecules with optimized properties. Previous work has shown success for the Transformer known from natural language processing [1, 2] in the area of molecular optimization. The thesis covers two extensions of the developed Transformer model in [1] through curriculum learning and Core-Fixed formulation. Through curriculum learning, training is structured through a sequence of tasks (curriculum) based on increasing difficulty. The curriculum could either be determined while training a model (machine-based) or manually (human heuristic-based). The thesis explores various approaches to human-based curriculum learning. For the other extension, Core-Fixed formulation, the thesis provides an approach to reformulating the input and output of the original model [1], which involves specifying in the input to the translation model which part that should be fixed (core) and which part that should be exchanged (R-group) to optimize the complete molecule’s properties. The results show advantages both in training time and molecule generation performance using the Core-Fixed formulation. For curriculum learning, the results do not indicate a clear improvement. The thesis suggests looking into more sophisticated curriculum learning approaches.	sv
dc.identifier.uri	https://hdl.handle.net/20.500.12380/302297
dc.language.iso	eng	sv
dc.setspec.uppsok	Technology
dc.subject	Molecular Optimization	sv
dc.subject	Matched Molecular Pairs	sv
dc.subject	Transformer	sv
dc.subject	AD-MET	sv
dc.subject	Master’s Thesis	sv
dc.title	Molecular Optimization using Deep Learning Extensions of the Transformer for Molecular Optimization	sv
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.uppsok	H

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: CSE 21-12 Forsberg Mattsson.pdf
Storlek:: 2.65 MB
Format:: Adobe Portable Document Format
Beskrivning:

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Storlek:: 1.14 KB
Format:: Item-specific license agreed upon to submission
Beskrivning:

Ladda ner

Samlingar

Examensarbeten för masterexamen