Human-in-the-loop control of molecular reinforcement learning with online adaptive classifiers

dc.contributor.authorHolst, Edwin
dc.contributor.authorMutharasu, Preetha
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.departmentChalmers University of Technology / Department of Computer Science and Engineeringen
dc.contributor.examinerEngkvist, Ola
dc.contributor.supervisorMercado, RocĂ­o
dc.date.accessioned2023-12-22T13:39:54Z
dc.date.available2023-12-22T13:39:54Z
dc.date.issued2023
dc.date.submitted2023
dc.description.abstractThe early stage of drug discovery faces significant challenges of screening through a vast number of compounds to identify potential drug candidates for specific diseases. Amidst a range of AI-based systems employed in efficiently identifying or generating potential drug candidates, this thesis focuses on REINVENT, a prominent production-ready tool for de novo design. Despite being advanced with multiple scoring options, it is challenging for REINVENT to capture human intuitions for generating desired outcomes. This thesis explores the significance of integrating human feedback to REINVENT through interactive visualization and online learning models. A range of methods have been employed during the development, First to enhance users’ understanding of generated compounds, diverse compound generation was studied, leading to an interactive visualization platform. We aim to offer a platform enabling effective user guidance. Second, to capture human preference, human feedback was integrated as a separate scoring function using online learning models. Considering the time and resources, surrogate user models were employed to represent real chemists, allowing for efficient development. During this testing, various aspects of the proposed system, including different online learning models, rating frequencies, sampling methods, and the number of rated molecules were tested and estimated. An evaluation experiment involving eight human participants demonstrated that integrating the HITL system to REINVENT can accelerate the drug discovery process by integrating AI capabilities with human expertise. It can effectively enhance the identification of valuable molecules, reduces compound analysis time, and ultimately results in improved patient outcomes and cost-effectiveness.
dc.identifier.coursecodeDATX05
dc.identifier.urihttp://hdl.handle.net/20.500.12380/307482
dc.language.isoeng
dc.setspec.uppsokTechnology
dc.subjectHuman-in-the-loop
dc.subjectdrug discovery
dc.subjectgenerative AI
dc.subjectREINVENT
dc.subjectvisualization
dc.subjectde novo
dc.titleHuman-in-the-loop control of molecular reinforcement learning with online adaptive classifiers
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeBiomedical engineering (MPBME), MSc
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 23-109 EH PM.pdf
Storlek:
8.64 MB
Format:
Adobe Portable Document Format
Beskrivning:
License bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
2.35 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: