Abstract
Data-driven machine learning methods have the potential to dramatically accelerate the rate of materials design over conventional human-guided approaches. These methods would help identify or, in the case of generative models, even create novel crystal structures of materials with a set of specified functional properties to then be synthesized or isolated in the laboratory. For crystal structure generation, a key bottleneck lies in developing suitable atomic structure fingerprints or representations for the machine learning model, analogous to the graph-based or SMILES representations used in molecular generation. However, finding data-efficient representations that are invariant to translations, rotations, and permutations, while remaining invertible to the Cartesian atomic coordinates remains an ongoing challenge. Here, we propose an alternative approach to this problem by taking existing non-invertible representations with the desired invariances and developing an algorithm to reconstruct the atomic coordinates through gradient-based optimization using automatic differentiation. This can then be coupled to a generative machine learning model which generates new materials within the representation space, rather than in the data-inefficient Cartesian space. In this work, we implement this end-to-end structure generation approach using atom-centered symmetry functions as the representation and conditional variational autoencoders as the generative model. We are able to successfully generate novel and valid atomic structures of sub-nanometer Pt nanoparticles as a proof of concept. Furthermore, this method can be readily extended to any suitable structural representation, thereby providing a powerful, generalizable framework towards structure-based generation.
Original language | English |
---|---|
Article number | 045018 |
Journal | Machine Learning: Science and Technology |
Volume | 3 |
Issue number | 4 |
DOIs | |
State | Published - Dec 1 2022 |
Funding
This work was supported by the Artificial Intelligence Initiative at the Oak Ridge National Laboratory (ORNL). Work was performed at Oak Ridge National Laboratory’s Center for Nanophase Materials Sciences and used resources of the Oak Ridge Leadership Computing Facility (OLCF), which are US Department of Energy Office of Science User Facilities. VF was also supported by a Eugene P Wigner Fellowship at Oak Ridge National Laboratory. ORNL is operated by UT-Battelle, LLC., for the U.S. Department of Energy under Contract DEAC05-00OR22725. This research used resources of the National Energy Research Scientific Computing Center, supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231. This work was supported by the Artificial Intelligence Initiative at the Oak Ridge National Laboratory (ORNL). Work was performed at Oak Ridge National Laboratory’s Center for Nanophase Materials Sciences and used resources of the Oak Ridge Leadership Computing Facility (OLCF), which are US Department of Energy Office of Science User Facilities. VF was also supported by a Eugene P Wigner Fellowship at Oak Ridge National Laboratory. ORNL is operated by UT-Battelle, LLC., for the U.S. Department of Energy under Contract DEAC05-00OR22725. This research used resources of the National Energy Research Scientific Computing Center, supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231.
Funders | Funder number |
---|---|
Oak Ridge National Laboratory | |
U.S. Department of Energy | DEAC05-00OR22725 |
Office of Science | DE-AC02-05CH11231 |
Oak Ridge National Laboratory |
Keywords
- atomic structure
- generative modelling
- machine learning
- materials discovery
- structure representations