Zero coordinate shift: Whetted automatic differentiation for physics-informed operator learning

Kuangdai Leng, Mallikarjun Shankar, Jeyan Thiyagalingam

Research output: Contribution to journalArticlepeer-review

Abstract

Automatic differentiation (AD) is a critical step in physics-informed machine learning, required for computing the high-order derivatives of network output w.r.t. coordinates of collocation points. In this paper, we present a novel and lightweight algorithm to conduct AD for physics-informed operator learning, which we call the trick of Zero Coordinate Shift (ZCS). Instead of making all sampled coordinates as leaf variables, ZCS introduces only one scalar-valued leaf variable for each spatial or temporal dimension, simplifying the wanted derivatives from “many-roots-many-leaves” to “one-root-many-leaves” whereby reverse-mode AD becomes directly utilisable. It has led to an outstanding performance leap by avoiding the duplication of the computational graph along the dimension of functions (physical parameters). ZCS is easy to implement with current deep learning libraries; our own implementation is achieved by extending the DeepXDE package. We carry out a comprehensive benchmark analysis and several case studies, training physics-informed DeepONets to solve partial differential equations (PDEs) without data. The results show that ZCS has persistently reduced GPU memory consumption and wall time for training by an order of magnitude, and such reduction factor scales with the number of functions. As a low-level optimisation technique, ZCS imposes no restrictions on data, physics (PDE) or network architecture and does not compromise training results from any aspect.

Original languageEnglish
Article number112904
JournalJournal of Computational Physics
Volume505
DOIs
StatePublished - May 15 2024

Funding

We thank the two reviewers of this paper for their constructive suggestions. We thank Lu Lu for supporting us on building ZCS into DeepXDE. This work is supported by the EPSRC grant, Blueprinting for AI for Science at Exascale (BASE-II, EP/X019918/1 ), and by the International Science Partnerships Fund (ISPF), most specifically through the AI for Realistic Science (AIRS) programme in collaboration with the Department of Energy (DOE) supported by the Oak Ridge Leadership Computing Facility (OLCF) under DOE Contract No. DE-AC05-00OR22725 .

FundersFunder number
International Science Partnerships Fund
Oak Ridge National LaboratoryDE-AC05-00OR22725
U.S. Department of Energy
Engineering and Physical Sciences Research CouncilEP/X019918/1

    Keywords

    • Automatic differentiation
    • Deep learning
    • Partial differential equations
    • Physics-informed

    Fingerprint

    Dive into the research topics of 'Zero coordinate shift: Whetted automatic differentiation for physics-informed operator learning'. Together they form a unique fingerprint.

    Cite this