Improving materials property predictions for graph neural networks with minimal feature engineering

Guojing Cong, Victor Fung

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Graph neural networks (GNNs) have been employed in materials research to predict physical and functional properties, and have achieved superior performance in several application domains over prior machine learning approaches. Recent studies incorporate features of increasing complexity such as Gaussian radial functions, plane wave functions, and angular terms to augment the neural network models, with the expectation that these features are critical for achieving a high performance. Here, we propose a GNN that adopts edge convolution where hidden edge features evolve during training and extensive attention mechanisms, and operates on simple graphs with atoms as nodes and distances between them as edges. As a result, the same model can be used for very different tasks as no other domain-specific features are used. With a model that uses no feature engineering, we achieve performance comparable with state-of-the-art models with elaborate features for formation energy and band gap prediction with standard benchmarks; we achieve even better performance when the dataset size increases. Although some domain-specific datasets still require hand-crafted features to achieve state-of-the-art results, our selected architecture choices greatly reduce the need for elaborate feature engineering and still maintain predictive power in comparison.

Original languageEnglish
Article number035030
JournalMachine Learning: Science and Technology
Volume4
Issue number3
DOIs
StatePublished - Sep 1 2023

Funding

This research used resources of the Compute and Data Environment for Science (CADES) at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725. This material is based upon work supported in part by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research, under Contract Number DE-AC05-00OR22725, and in part by the Laboratory Directed Research and Development Program of Oak Ridge National Laboratory, managed by UT-Battelle, LLC.

FundersFunder number
CADES
Data Environment for Science
U.S. Department of Energy
Office of Science
Advanced Scientific Computing ResearchDE-AC05-00OR22725
Oak Ridge National Laboratory
UT-Battelle

    Keywords

    • GNN
    • feature engineering
    • materials

    Fingerprint

    Dive into the research topics of 'Improving materials property predictions for graph neural networks with minimal feature engineering'. Together they form a unique fingerprint.

    Cite this