Reinforcement Learning-Guided Generative Design of Surfactant Molecules - Exploring Reward Functions for Multi-Objective Molecular Optimisation

dc.contributor.authorÖhman, Hannes
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.departmentChalmers University of Technology / Department of Computer Science and Engineeringen
dc.contributor.examinerMercado Oropeza, Rocío
dc.contributor.supervisorBeckmann, Richard
dc.date.accessioned2026-06-30T08:50:33Z
dc.date.issued
dc.date.submitted
dc.description.abstractSurfactants are chemical compounds with amphiphilic molecular structures that find application across a wide range of industries, from cleaning products and food processing to pharmaceuticals and waterproof textiles. Despite their ubiquity, many surfactants in current industrial use raise significant environmental concerns, and the physicochemical performance of even commonly used compounds often falls far short of what is theoreti cally achievable. Designing better surfactants through conventional experimental methods is slow and costly, motivating the development of computational approaches capable of navigating the vast space of possible molecular structures more intelligently. This thesis presents a computational framework for the inverse design of surfactant molecules with improved physicochemical properties, combining deep generative modelling with reinforcement learning and active learning. The framework is built on the REINVENT4 platform, employing a recurrent neural network pretrained on a large chemical database and subsequently fine-tuned on the SurfPro surfactant dataset through transfer learn ing. Reinforcement learning is then applied to guide generation toward molecules with favourable values of critical micelle concentration (pCMC) and surface tension, using surrogate models as the property oracle. Three distinct optimisation strategies are in vestigated and compared: iterative retraining, Pareto Gradient scoring, and Origin Pull scoring. The results demonstrate that the framework reliably generates chemically valid and novel molecules that extend beyond the Pareto front of the SurfPro training dataset according to surrogate model predictions. A key finding is that balanced multi-objective scoring functions both outperform single-objective weightings and better preserve chemical diver sity throughout training. However, a recurring failure mode termed here the hydrocarbon trap is identified, in which the optimisation converges on structurally degenerate molecules that lack the amphiphilic head-tail architecture required of functional surfactants. A post-processing filter based on physicochemical descriptors derived from the SurfPro dataset is introduced to screen out these spurious candidates and recover the viable subset of generated molecules. Taken together, the results constitute a proof of concept that reinforcement learning guided generative models are capable of exploring surfactant chemical space in a principled and targeted manner. While the generated candidates require experimental validation before their practical utility can be confirmed, this framework represents a promising foundation for future, more refined computational campaigns aimed at discovering next generation surfactants.
dc.identifier.coursecodeDATX05
dc.identifier.urihttps://hdl.handle.net/20.500.12380/311650
dc.language.isoeng
dc.setspec.uppsokTechnology
dc.subjectIndustrial Chemistry, Data Science, Generative Models, AI, Computer Science, Surfactants, Molecular Engineering, Molecular Design
dc.titleReinforcement Learning-Guided Generative Design of Surfactant Molecules - Exploring Reward Functions for Multi-Objective Molecular Optimisation
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeComplex adaptive systems (MPCAS), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 26-39 HÖ.pdf
Size:
2.82 MB
Format:
Adobe Portable Document Format

License bundle

Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Size:
2.35 KB
Format:
Item-specific license agreed upon to submission
Description: