Abstract
Despite their wide and successful applications, deep learning (DL) models are prone to overfitting for small training datasets, produce a poor predictive performance for uncertain data, and provide point estimations without any indication of the accuracy and credibility. These limitations of the deterministic DL models hinder their effective application in Earth system science where the labelled data are sparse, noisy and incomplete with large uncertainty and where the predictive uncertainty quantification is needed for scientific understanding and policy decision making. Integration of Bayesian inference into DL models adds an estimate of uncertainty and regularization in the predictions. However, traditional Bayesian methods are computationally unaffordable and inflexible for high-dimensional problems, which limits their application to DL systems that typically have millions of model parameters. In this effort, we propose an efficient and general-purpose Bayesian inference method to advance DL model optimization and uncertainty quantification, so as to facilitate the adoption of DL in Earth sciences. In a demonstration, we integrate the proposed Bayesian method with a feedforward neural network (NN) to build a fast-to-evaluate surrogate of the complex Energy Exascale Earth System Land Model for efficient modeling. The formulated Bayesian NN, using a small number of training data, produces an accurate prediction with high credibility, whereas with the same small training size, the deterministic NN cannot yield a reasonable estimation and does not provide confidence information. The proposed Bayesian method is computationally efficient and flexible, capable of integration with diverse network variants such as convolutional NNs and recurrent NNs to advance the application of DL in Earth sciences.
Original language | English |
---|---|
Title of host publication | Proceedings - 19th IEEE International Conference on Data Mining Workshops, ICDMW 2019 |
Editors | Panagiotis Papapetrou, Xueqi Cheng, Qing He |
Publisher | IEEE Computer Society |
Pages | 270-278 |
Number of pages | 9 |
ISBN (Electronic) | 9781728146034 |
DOIs | |
State | Published - Nov 2019 |
Event | 19th IEEE International Conference on Data Mining Workshops, ICDMW 2019 - Beijing, China Duration: Nov 8 2019 → Nov 11 2019 |
Publication series
Name | IEEE International Conference on Data Mining Workshops, ICDMW |
---|---|
Volume | 2019-November |
ISSN (Print) | 2375-9232 |
ISSN (Electronic) | 2375-9259 |
Conference
Conference | 19th IEEE International Conference on Data Mining Workshops, ICDMW 2019 |
---|---|
Country/Territory | China |
City | Beijing |
Period | 11/8/19 → 11/11/19 |
Funding
ACKNOWLEDGMENT Primary support for this work was provided by the Scientific Discovery through Advanced Computing (SciDAC) program, funded by the U.S. Department of Energy (DOE), Office of Advanced Scientific Computing Research (ASCR) and Office of Biological and EnvironmentalResearch (BER). Additional support was provided by BER’s TerrestrialEcosystem Science Scientific Focus Area (TES-SFA) project and Oak Ridge National Laboratory (ORNL) AI initiative project. The authors are supported by ORNL, which is supported by the DOE under contract DE-AC05-00OR22725. Primary support for this work was provided by the Scientific Discovery through Advanced Computing (SciDAC) program, funded by the U.S. Department of Energy (DOE), Office of Advanced Scientific Computing Research (ASCR) and Office of Biological and Environmental Research (BER). Additional support was provided by BER's Terrestrial Ecosystem Science Scientific Focus Area (TES-SFA) project and Oak Ridge National Laboratory (ORNL) AI initiative project. The authors are supported by ORNL, which is supported by the DOE under contract DE-AC05-00OR22725.
Keywords
- Bayesian inference
- Deep learning
- Earth system science application
- Uncertainty quantification