Abstract
Machine learning (ML) is becoming a method of choice for modelling complex chemical processes and materials. ML provides a surrogate model trained on a reference dataset that can be used to establish a relationship between a molecular structure and its chemical properties. This Review highlights developments in the use of ML to evaluate chemical properties such as partial atomic charges, dipole moments, spin and electron densities, and chemical bonding, as well as to obtain a reduced quantum-mechanical description. We overview several modern neural network architectures, their predictive capabilities, generality and transferability, and illustrate their applicability to various chemical properties. We emphasize that learned molecular representations resemble quantum-mechanical analogues, demonstrating the ability of the models to capture the underlying physics. We also discuss how ML models can describe non-local quantum effects. Finally, we conclude by compiling a list of available ML toolboxes, summarizing the unresolved challenges and presenting an outlook for future development. The observed trends demonstrate that this field is evolving towards physics-based models augmented by ML, which is accompanied by the development of new methods and the rapid growth of user-friendly ML frameworks for chemistry. [Figure not available: see fulltext.]
Original language | English |
---|---|
Pages (from-to) | 653-672 |
Number of pages | 20 |
Journal | Nature Reviews Chemistry |
Volume | 6 |
Issue number | 9 |
DOIs | |
State | Published - Sep 2022 |
Externally published | Yes |
Funding
The work at Los Alamos National Laboratory (LANL) was supported by the LANL Directed Research and Development Funds (LDRD) and performed in part at the Center for Nonlinear Studies (CNLS) and the Center for Integrated Nanotechnologies (CINT), a US Department of Energy (DOE) Office of Science user facility at LANL. N.F. and M.K. acknowledge financial support from the Director’s Postdoctoral Fellowship at LANL funded by LDRD. K.B. and S.T. acknowledge support from the US DOE, Office of Science, Basic Energy Sciences, Chemical Sciences, Geosciences, and Biosciences Division under Triad National Security (Triad) contract grant number 89233218CNA000001 (FWP: LANLE3F2). This research used resources provided by the LANL Institutional Computing Program. LANL is managed by Triad National Security for the US DOE’s NNSA, under contract number 89233218CNA000001. A.I.B. acknowledges the R. Gaurth Hansen Professorship. O.I. acknowledges support from the National Science Foundation (NSF) grants CHE-1802789 and CHE-2041108. The work performed by O.I. and R.Z. in part was made possible by the Office of Naval Research (ONR) through support provided by the Energetic Materials Program (MURI grant number N00014-21-1-2476).
Funders | Funder number |
---|---|
Center for Nonlinear Studies | |
LANL Directed Research and Development Funds | |
National Science Foundation | CHE-1802789, CHE-2041108 |
Office of Naval Research | N00014-21-1-2476 |
U.S. Department of Energy | |
Office of Science | |
Basic Energy Sciences | |
Los Alamos National Laboratory | |
Chemical Sciences, Geosciences, and Biosciences Division | 89233218CNA000001, LANLE3F2 |
Center for Integrated Nanotechnologies |