RMG Database for Chemical Property Prediction

Matthew S. Johnson, Xiaorui Dong, Alon Grinberg Dana, Yunsie Chung, David Farina, Ryan J. Gillis, Mengjie Liu, Nathan W. Yee, Katrin Blondal, Emily Mazeau, Colin A. Grambow, A. Mark Payne, Kevin A. Spiekermann, Hao Wei Pang, C. Franklin Goldsmith, Richard H. West, William H. Green

Research output: Contribution to journalArticlepeer-review

67 Scopus citations

Abstract

The Reaction Mechanism Generator (RMG) database for chemical property prediction is presented. The RMG database consists of curated datasets and estimators for accurately predicting the parameters necessary for constructing a wide variety of chemical kinetic mechanisms. These datasets and estimators are mostly published and enable prediction of thermodynamics, kinetics, solvation effects, and transport properties. For thermochemistry prediction, the RMG database contains 45 libraries of thermochemical parameters with a combination of 4564 entries and a group additivity scheme with 9 types of corrections including radical, polycyclic, and surface absorption corrections with 1580 total curated groups and parameters for a graph convolutional neural network trained using transfer learning from a set of >130 000 DFT calculations to 10 000 high-quality values. Correction schemes for solvent-solute effects, important for thermochemistry in the liquid phase, are available. They include tabulated values for 195 pure solvents and 152 common solutes and a group additivity scheme for predicting the properties of arbitrary solutes. For kinetics estimation, the database contains 92 libraries of kinetic parameters containing a combined 21000 reactions and contains rate rule schemes for 87 reaction classes trained on 8655 curated training reactions. Additional libraries and estimators are available for transport properties. All of this information is easily accessible through the graphical user interface at https://rmg.mit.edu. Bulk or on-the-fly use can be facilitated by interfacing directly with the RMG Python package which can be installed from Anaconda. The RMG database provides kineticists with easy access to estimates of the many parameters they need to model and analyze kinetic systems. This helps to speed up and facilitate kinetic analysis by enabling easy hypothesis testing on pathways, by providing parameters for model construction, and by providing checks on kinetic parameters from other sources.

Original languageEnglish
Pages (from-to)4906-4915
Number of pages10
JournalJournal of Chemical Information and Modeling
Volume62
Issue number20
DOIs
StatePublished - Oct 24 2022
Externally publishedYes

Funding

Funding from the Gas Phase Chemical Physics Program of the US Department of Energy, Office of Basic Energy Sciences, Division of Chemical Sciences, Geosciences, and Biosciences (under award number DESC0014901), is appreciated. The work was also supported by the U.S. Department of Energy, Office of Science, Basic Energy Sciences, through the Exascale Catalytic Chemistry (ECC) Project as part of the Computational Chemical Sciences Program, and by the National Science Foundation under grant no. 1751720. A.G.D. was supported by The George J. Elbaum Scholarship in Engineering, The Ed Satell Foundation, and The Zuckerman STEM Leadership Program. H.P. was supported by the Think Global Education Trust Scholarship.

FundersFunder number
Exascale Catalytic Chemistry
Satell Foundation
National Science Foundation1751720
U.S. Department of Energy
Office of Science
Basic Energy Sciences
Chemical Sciences, Geosciences, and Biosciences DivisionDESC0014901

    Fingerprint

    Dive into the research topics of 'RMG Database for Chemical Property Prediction'. Together they form a unique fingerprint.

    Cite this