Abstract
Predicting the properties of a molecule from its structure is a challenging task. Recently, deep learning methods have improved the state of the art for this task because of their ability to learn useful features from the given data. By treating molecule structure as graphs, where atoms and bonds are modeled as nodes and edges, graph neural networks (GNNs) have been widely used to predict molecular properties. However, the design and development of GNNs for a given dataset rely on labor-intensive design and tuning of the network architectures. Neural architecture search (NAS) is a promising approach to discover high-performing neural network architectures automatically. To that end, we develop an NAS approach to automate the design and development of GNNs for molecular property prediction. Specifically, w e f ocus o n a utomated d evelopment of message-passing neural networks (MPNNs) to predict the molecular properties of small molecules in quantum mechanics and physical chemistry datasets from the MoleculeNet benchmark. We demonstrate the superiority of the automatically discovered MPNNs by comparing them with manually designed GNNs from the MoleculeNet benchmark. We demonstrate that customizing the architecture is critical to enhancing performance in molecular property prediction and that the proposed approach can perform customization automatically with minimal manual effort.
Original language | English |
---|---|
Title of host publication | Proceedings - 2020 IEEE International Conference on Big Data, Big Data 2020 |
Editors | Xintao Wu, Chris Jermaine, Li Xiong, Xiaohua Tony Hu, Olivera Kotevska, Siyuan Lu, Weijia Xu, Srinivas Aluru, Chengxiang Zhai, Eyhab Al-Masri, Zhiyuan Chen, Jeff Saltz |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1346-1353 |
Number of pages | 8 |
ISBN (Electronic) | 9781728162515 |
DOIs | |
State | Published - Dec 10 2020 |
Externally published | Yes |
Event | 8th IEEE International Conference on Big Data, Big Data 2020 - Virtual, Atlanta, United States Duration: Dec 10 2020 → Dec 13 2020 |
Publication series
Name | Proceedings - 2020 IEEE International Conference on Big Data, Big Data 2020 |
---|
Conference
Conference | 8th IEEE International Conference on Big Data, Big Data 2020 |
---|---|
Country/Territory | United States |
City | Virtual, Atlanta |
Period | 12/10/20 → 12/13/20 |
Funding
This material is based on work supported by the U.S. Department of Energy (DOE), Office of Science, Office of Advanced Scientific Computing Research, under Contract DE-AC02-06CH11357. We gratefully acknowledge the computing resources provided on Bebop, a high-performance computing cluster operated by the Laboratory Computing Resource Center at Argonne National Laboratory.
Keywords
- AutoML
- deep learning
- graph neural networks
- neural architecture search
- regularized evolution