Human-in-the-loop control of molecular reinforcement learning with online adaptive classifiers
dc.contributor.author | Holst, Edwin | |
dc.contributor.author | Mutharasu, Preetha | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för data och informationsteknik | sv |
dc.contributor.department | Chalmers University of Technology / Department of Computer Science and Engineering | en |
dc.contributor.examiner | Engkvist, Ola | |
dc.contributor.supervisor | Mercado, RocĂo | |
dc.date.accessioned | 2023-12-22T13:39:54Z | |
dc.date.available | 2023-12-22T13:39:54Z | |
dc.date.issued | 2023 | |
dc.date.submitted | 2023 | |
dc.description.abstract | The early stage of drug discovery faces significant challenges of screening through a vast number of compounds to identify potential drug candidates for specific diseases. Amidst a range of AI-based systems employed in efficiently identifying or generating potential drug candidates, this thesis focuses on REINVENT, a prominent production-ready tool for de novo design. Despite being advanced with multiple scoring options, it is challenging for REINVENT to capture human intuitions for generating desired outcomes. This thesis explores the significance of integrating human feedback to REINVENT through interactive visualization and online learning models. A range of methods have been employed during the development, First to enhance users’ understanding of generated compounds, diverse compound generation was studied, leading to an interactive visualization platform. We aim to offer a platform enabling effective user guidance. Second, to capture human preference, human feedback was integrated as a separate scoring function using online learning models. Considering the time and resources, surrogate user models were employed to represent real chemists, allowing for efficient development. During this testing, various aspects of the proposed system, including different online learning models, rating frequencies, sampling methods, and the number of rated molecules were tested and estimated. An evaluation experiment involving eight human participants demonstrated that integrating the HITL system to REINVENT can accelerate the drug discovery process by integrating AI capabilities with human expertise. It can effectively enhance the identification of valuable molecules, reduces compound analysis time, and ultimately results in improved patient outcomes and cost-effectiveness. | |
dc.identifier.coursecode | DATX05 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12380/307482 | |
dc.language.iso | eng | |
dc.setspec.uppsok | Technology | |
dc.subject | Human-in-the-loop | |
dc.subject | drug discovery | |
dc.subject | generative AI | |
dc.subject | REINVENT | |
dc.subject | visualization | |
dc.subject | de novo | |
dc.title | Human-in-the-loop control of molecular reinforcement learning with online adaptive classifiers | |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.degree | Master's Thesis | en |
dc.type.uppsok | H | |
local.programme | Biomedical engineering (MPBME), MSc |