Implicit Graph Neural Networks: A Monotone Operator Viewpoint

Justin Baker, Qingsong Wang, Cory Hauck, Bao Wang

Research output: Contribution to journalConference articlepeer-review

3 Scopus citations

Abstract

Implicit graph neural networks (IGNNs) - that solve a fixed-point equilibrium equation using Picard iteration for representation learning - have shown remarkable performance in learning long-range dependencies (LRD) in the underlying graphs. However, IGNNs suffer from several issues, including 1) their expressivity is limited by their parameterizations for the well-posedness guarantee, 2) IGNNs are unstable in learning LRD, and 3) IGNNs become computationally inefficient when learning LRD. In this paper, we provide a new well-posedness characterization for IGNNs leveraging monotone operator theory, resulting in a much more expressive parameterization than the existing one. We also propose an orthogonal parameterization for IGNN based on Cayley transform to stabilize learning LRD. Furthermore, we leverage Anderson-accelerated operator splitting schemes to efficiently solve for the fixed point of the equilibrium equation of IGNN with monotone or orthogonal parameterization. We verify the computational efficiency and accuracy of the new models over existing IGNNs on various graph learning tasks at both graph and node levels. Code is available at https://github.com/Utah-Math-Data-Science/MIGNN.

Original languageEnglish
Pages (from-to)1521-1548
Number of pages28
JournalProceedings of Machine Learning Research
Volume202
StatePublished - 2023
Event40th International Conference on Machine Learning, ICML 2023 - Honolulu, United States
Duration: Jul 23 2023Jul 29 2023

Funding

This material is based on research sponsored by NSF grants DMS-1952339, DMS-2152762, and DMS-2208361, DOE grant DE-SC0021142 and DE-SC0023490. Moreover, this material is based, in part, upon work supported by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research, as part of their Applied Mathematics Research Program. The work was performed at the Oak Ridge National Laboratory, which is managed by UT-Battelle, LLC under Contract No. DeAC05-00OR22725. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for the United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). This material is based on research sponsored by NSF grants DMS-1952339, DMS-2152762, and DMS-2208361, DOE grant DE-SC0021142 and DE-SC0023490. Moreover, this material is based, in part, upon work supported by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research, as part of their Applied Mathematics Research Program. The work was performed at the Oak Ridge National Laboratory, which is managed by UT-Battelle, LLC under Contract No. De-AC05-00OR22725. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for the United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).

FundersFunder number
DOE Public Access Plan
United States Government
National Science FoundationDMS-2208361, DMS-1952339, DMS-2152762
U.S. Department of EnergyDE-SC0021142, DE-SC0023490
Office of Science
Advanced Scientific Computing ResearchDe-AC05-00OR22725

    Fingerprint

    Dive into the research topics of 'Implicit Graph Neural Networks: A Monotone Operator Viewpoint'. Together they form a unique fingerprint.

    Cite this