Abstract
Long short-term memory (LSTM) networks have demonstrated successful applications in accurately and efficiently predicting reservoir releases from hydrometeorological drivers including reservoir storage, inflow, precipitation, and temperature. However, due to its black-box nature and lack of process-based implementation, we are unsure whether LSTM makes good predictions for the right reasons. In this work, we use an explainable machine learning (ML) method, called SHapley Additive exPlanations (SHAP), to evaluate the variable importance and variable-wise temporal importance in the LSTM model prediction. In application to 30 reservoirs over the Upper Colorado River Basin, United States, we show that LSTM can accurately predict the reservoir releases with NSE ≥ 0.69 for all the considered reservoirs despite of their diverse storage sizes, functionality, elevations, etc. Additionally, SHAP indicates that storage and inflow are more influential than precipitation and temperature. Moreover, the storage and inflow show a relatively long-term influence on the release up to 7 days and this influence decreases as the lag time increases for most reservoirs. These findings from SHAP are consistent with our physical understanding. However, in a few reservoirs, SHAP gives some temporal importances that are difficult to interpret from a hydrological point of view, probably because of its ignorance of the variable interactions. SHAP is a useful tool for black-box ML model explanations, but the hydrological processes inferred from its results should be interpreted cautiously. More investigations of SHAP and its applications in hydrological modeling is needed and will be pursued in our future study.
Original language | English |
---|---|
Article number | 1112970 |
Journal | Frontiers in Water |
Volume | 5 |
DOIs | |
State | Published - 2023 |
Funding
The major funding support for this work is provided by the waterpower project funded by the U.S. Department of Energy (DOE), Water Power Technologies Office. Additional support was from the Artificial Intelligence Initiative as part of the Laboratory Directed Research and Development Program of Oak Ridge National Laboratory, managed by UT-Battelle, LLC, for the US DOE under contract DE-AC05-00OR22725. This research was also sponsored by the Data-Driven Decision Control for Complex Systems (DnC2S) project funded by the U.S. DOE, Office of Advanced Scientific Computing Research. The work was also partially supported by the National Science Foundation under Grant No. OIA-1946093 and its subaward No. EPSCoR-2020-3, and the Oklahoma EPSCoR Research Seed Grand supported by NSF. This work was also partially supported by U.S. Department of Defense, Army Corps of Engineers (DOD-COR) Engineering With Nature (EWN) Program (Award No. W912HZ-21-2-0038).
Funders | Funder number |
---|---|
Artificial Intelligence Initiative | |
DOD-COR | W912HZ-21-2-0038 |
Oklahoma EPSCoR | |
National Science Foundation | EPSCoR-2020-3, OIA-1946093 |
U.S. Department of Defense | |
U.S. Department of Energy | DE-AC05-00OR22725 |
Advanced Scientific Computing Research | |
Oak Ridge National Laboratory | |
U.S. Army Corps of Engineers | |
Water Power Technologies Office |
Keywords
- SHAP
- deep learning
- hydrometeorological factor
- long short-term memory network
- reservoir release
- temporal importance