Abstract
A growing number of physical objects with embedded sensors with typically high volume and frequently updated data sets has accentuated the need to develop methodologies to extract useful information from big data for supporting decision making. This study applies a suite of data analytics and core principles of data science to characterize near real-time meteorological data with a focus on extreme weather events. To highlight the applicability of this work and make it more accessible from a risk management perspective, a foundation for a software platform with an intuitive Graphical User Interface (GUI) was developed to access and analyze data from a decommissioned nuclear production complex operated by the U.S. Department of Energy (DOE, Richland, USA). Exploratory data analysis (EDA), involving classical non-parametric statistics, and machine learning (ML) techniques, were used to develop statistical summaries and learn characteristic features of key weather patterns and signatures. The new approach and GUI provide key insights into using big data and ML to assist site operation related to safety management strategies for extreme weather events. Specifically, this work offers a practical guide to analyzing long-term meteorological data and highlights the integration of ML and classical statistics to applied risk and decision science.
| Original language | English |
|---|---|
| Article number | 136 |
| Journal | Atmosphere |
| Volume | 13 |
| Issue number | 1 |
| DOIs | |
| State | Published - Jan 2022 |
| Externally published | Yes |
Funding
Acknowledgments: Pacific Northwest National Laboratory (PNNL) is operated for the U.S. DOE by Battelle Memorial Institute under Contract No. DE-AC05-76RL01830. Funding: The authors are grateful for the programmatic support from the Department of Energy (DOE) NNSA AU31 program. The opinions expressed are solely based on the research results of the authors. Computational resources are from R and Python.
Keywords
- Classification
- Exploratory data analysis
- Graphical User Interface (GUI)
- Hanford site
- Heatwave
- High wind
- Machine learning
- Meteorological data
- Random forest