Generating stable molecules using imitation and reinforcement learning

Ager Meldgaard, Søren and Köhler, Jonas and Lund Mortensen, Henrik and Christiansen, Mads-Peter V and Noé, Frank and Hammer, Bjørk (2022) Generating stable molecules using imitation and reinforcement learning. Machine Learning: Science and Technology, 3 (1). 015008. ISSN 2632-2153

[thumbnail of Ager_Meldgaard_2022_Mach._Learn.__Sci._Technol._3_015008.pdf] Text
Ager_Meldgaard_2022_Mach._Learn.__Sci._Technol._3_015008.pdf - Published Version

Download (14MB)

Abstract

Chemical space is routinely explored by machine learning methods to discover interesting molecules, before time-consuming experimental synthesizing is attempted. However, these methods often rely on a graph representation, ignoring 3D information necessary for determining the stability of the molecules. We propose a reinforcement learning (RL) approach for generating molecules in Cartesian coordinates allowing for quantum chemical prediction of the stability. To improve sample-efficiency we learn basic chemical rules from imitation learning (IL) on the GDB-11 database to create an initial model applicable for all stoichiometries. We then deploy multiple copies of the model conditioned on a specific stoichiometry in a RL setting. The models correctly identify low energy molecules in the database and produce novel isomers not found in the training set. Finally, we apply the model to larger molecules to show how RL further refines the IL model in domains far from the training data.

Item Type: Article
Subjects: OA Library Press > Multidisciplinary
Depositing User: Unnamed user with email support@oalibrarypress.com
Date Deposited: 06 Jul 2023 04:16
Last Modified: 17 May 2024 10:27
URI: http://archive.submissionwrite.com/id/eprint/1364

Actions (login required)

View Item
View Item