Human-in-the-loop control of molecular reinforcement learning with online adaptive classifiers

Holst, Edwin; Mutharasu, Preetha

Human-in-the-loop control of molecular reinforcement learning with online adaptive classifiers

dc.contributor.author	Holst, Edwin
dc.contributor.author	Mutharasu, Preetha
dc.contributor.department	Chalmers tekniska högskola / Institutionen för data och informationsteknik	sv
dc.contributor.department	Chalmers University of Technology / Department of Computer Science and Engineering	en
dc.contributor.examiner	Engkvist, Ola
dc.contributor.supervisor	Mercado, Rocío
dc.date.accessioned	2023-12-22T13:39:54Z
dc.date.available	2023-12-22T13:39:54Z
dc.date.issued	2023
dc.date.submitted	2023
dc.description.abstract	The early stage of drug discovery faces significant challenges of screening through a vast number of compounds to identify potential drug candidates for specific diseases. Amidst a range of AI-based systems employed in efficiently identifying or generating potential drug candidates, this thesis focuses on REINVENT, a prominent production-ready tool for de novo design. Despite being advanced with multiple scoring options, it is challenging for REINVENT to capture human intuitions for generating desired outcomes. This thesis explores the significance of integrating human feedback to REINVENT through interactive visualization and online learning models. A range of methods have been employed during the development, First to enhance users’ understanding of generated compounds, diverse compound generation was studied, leading to an interactive visualization platform. We aim to offer a platform enabling effective user guidance. Second, to capture human preference, human feedback was integrated as a separate scoring function using online learning models. Considering the time and resources, surrogate user models were employed to represent real chemists, allowing for efficient development. During this testing, various aspects of the proposed system, including different online learning models, rating frequencies, sampling methods, and the number of rated molecules were tested and estimated. An evaluation experiment involving eight human participants demonstrated that integrating the HITL system to REINVENT can accelerate the drug discovery process by integrating AI capabilities with human expertise. It can effectively enhance the identification of valuable molecules, reduces compound analysis time, and ultimately results in improved patient outcomes and cost-effectiveness.
dc.identifier.coursecode	DATX05
dc.identifier.uri	http://hdl.handle.net/20.500.12380/307482
dc.language.iso	eng
dc.setspec.uppsok	Technology
dc.subject	Human-in-the-loop
dc.subject	drug discovery
dc.subject	generative AI
dc.subject	REINVENT
dc.subject	visualization
dc.subject	de novo
dc.title	Human-in-the-loop control of molecular reinforcement learning with online adaptive classifiers
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.degree	Master's Thesis	en
dc.type.uppsok	H
local.programme	Biomedical engineering (MPBME), MSc

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: CSE 23-109 EH PM.pdf
Storlek:: 8.64 MB
Format:: Adobe Portable Document Format

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Storlek:: 2.35 KB
Format:: Item-specific license agreed upon to submission
Beskrivning:

Ladda ner

Samlingar

Examensarbeten för masterexamen