Big Data Analytics for Long-Term Meteorological Observations at Hanford Site

  • Huifen Zhou
  • , Huiying Ren
  • , Patrick Royer
  • , Hongfei Hou
  • , Xiao Ying Yu

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

A growing number of physical objects with embedded sensors with typically high volume and frequently updated data sets has accentuated the need to develop methodologies to extract useful information from big data for supporting decision making. This study applies a suite of data analytics and core principles of data science to characterize near real-time meteorological data with a focus on extreme weather events. To highlight the applicability of this work and make it more accessible from a risk management perspective, a foundation for a software platform with an intuitive Graphical User Interface (GUI) was developed to access and analyze data from a decommissioned nuclear production complex operated by the U.S. Department of Energy (DOE, Richland, USA). Exploratory data analysis (EDA), involving classical non-parametric statistics, and machine learning (ML) techniques, were used to develop statistical summaries and learn characteristic features of key weather patterns and signatures. The new approach and GUI provide key insights into using big data and ML to assist site operation related to safety management strategies for extreme weather events. Specifically, this work offers a practical guide to analyzing long-term meteorological data and highlights the integration of ML and classical statistics to applied risk and decision science.

Original languageEnglish
Article number136
JournalAtmosphere
Volume13
Issue number1
DOIs
StatePublished - Jan 2022
Externally publishedYes

Funding

Acknowledgments: Pacific Northwest National Laboratory (PNNL) is operated for the U.S. DOE by Battelle Memorial Institute under Contract No. DE-AC05-76RL01830. Funding: The authors are grateful for the programmatic support from the Department of Energy (DOE) NNSA AU31 program. The opinions expressed are solely based on the research results of the authors. Computational resources are from R and Python.

Keywords

  • Classification
  • Exploratory data analysis
  • Graphical User Interface (GUI)
  • Hanford site
  • Heatwave
  • High wind
  • Machine learning
  • Meteorological data
  • Random forest

Fingerprint

Dive into the research topics of 'Big Data Analytics for Long-Term Meteorological Observations at Hanford Site'. Together they form a unique fingerprint.

Cite this