Extending machine learning beyond interatomic potentials for predicting molecular properties

Nikita Fedik, Roman Zubatyuk, Maksim Kulichenko, Nicholas Lubbers, Justin S. Smith, Benjamin Nebgen, Richard Messerly, Ying Wai Li, Alexander I. Boldyrev, Kipton Barros, Olexandr Isayev, Sergei Tretiak

Research output: Contribution to journalReview articlepeer-review

60 Scopus citations

Abstract

Machine learning (ML) is becoming a method of choice for modelling complex chemical processes and materials. ML provides a surrogate model trained on a reference dataset that can be used to establish a relationship between a molecular structure and its chemical properties. This Review highlights developments in the use of ML to evaluate chemical properties such as partial atomic charges, dipole moments, spin and electron densities, and chemical bonding, as well as to obtain a reduced quantum-mechanical description. We overview several modern neural network architectures, their predictive capabilities, generality and transferability, and illustrate their applicability to various chemical properties. We emphasize that learned molecular representations resemble quantum-mechanical analogues, demonstrating the ability of the models to capture the underlying physics. We also discuss how ML models can describe non-local quantum effects. Finally, we conclude by compiling a list of available ML toolboxes, summarizing the unresolved challenges and presenting an outlook for future development. The observed trends demonstrate that this field is evolving towards physics-based models augmented by ML, which is accompanied by the development of new methods and the rapid growth of user-friendly ML frameworks for chemistry. [Figure not available: see fulltext.]

Original languageEnglish
Pages (from-to)653-672
Number of pages20
JournalNature Reviews Chemistry
Volume6
Issue number9
DOIs
StatePublished - Sep 2022
Externally publishedYes

Funding

The work at Los Alamos National Laboratory (LANL) was supported by the LANL Directed Research and Development Funds (LDRD) and performed in part at the Center for Nonlinear Studies (CNLS) and the Center for Integrated Nanotechnologies (CINT), a US Department of Energy (DOE) Office of Science user facility at LANL. N.F. and M.K. acknowledge financial support from the Director’s Postdoctoral Fellowship at LANL funded by LDRD. K.B. and S.T. acknowledge support from the US DOE, Office of Science, Basic Energy Sciences, Chemical Sciences, Geosciences, and Biosciences Division under Triad National Security (Triad) contract grant number 89233218CNA000001 (FWP: LANLE3F2). This research used resources provided by the LANL Institutional Computing Program. LANL is managed by Triad National Security for the US DOE’s NNSA, under contract number 89233218CNA000001. A.I.B. acknowledges the R. Gaurth Hansen Professorship. O.I. acknowledges support from the National Science Foundation (NSF) grants CHE-1802789 and CHE-2041108. The work performed by O.I. and R.Z. in part was made possible by the Office of Naval Research (ONR) through support provided by the Energetic Materials Program (MURI grant number N00014-21-1-2476).

FundersFunder number
Center for Nonlinear Studies
LANL Directed Research and Development Funds
National Science FoundationCHE-1802789, CHE-2041108
Office of Naval ResearchN00014-21-1-2476
U.S. Department of Energy
Office of Science
Basic Energy Sciences
Los Alamos National Laboratory
Chemical Sciences, Geosciences, and Biosciences Division89233218CNA000001, LANLE3F2
Center for Integrated Nanotechnologies

    Fingerprint

    Dive into the research topics of 'Extending machine learning beyond interatomic potentials for predicting molecular properties'. Together they form a unique fingerprint.

    Cite this