Using machine learning to discern eruption in noisy environments: A case study using CO 2 -driven cold-water geyser in Chimayó, New Mexico

Baichuan Yuan, Yen Joe Tan, Maruti K. Mudunuru, Omar E. Marcillo, Andrew A. Delorey, Peter M. Roberts, Jeremy D. Webster, Christine N.L. Gammans, Satish Karra, George D. Guthrie, Paul A. Johnson

Research output: Contribution to journalArticlepeer-review

15 Scopus citations

Abstract

We present an approach based on machine learning (ML) to distinguish eruption and precursory signals of Chimayó geyser (New Mexico, U.S.A.) under noisy environmental conditions. This geyser can be considered a natural analog of CO 2 intrusion into shallow water aquifers. By studying this geyser, we can understand upwelling of CO 2 -rich fluids from depth, which has relevance to leak monitoring in a CO 2 sequestration project. ML methods such as random forests (RFs) are known to be robust multiclass classifiers and perform well under unfavorable, noisy conditions. However, the extent of the RF method's accuracy is poorly understood for this CO 2 -driven geysering application. The current study aims to quantify the performance of RF classifiers to discern the geyser state. Toward this goal, we first present the data collected from the seismometer that is installed near the Chimayó geyser. The seismic signals collected at this site contain different types of noises such as daily temperature variations, animal movement near the geyser, and human activity. First, we filter the signals from these noises by combining the Butterworth high-pass (BH) filter and an autoregressive (AR) method in a multilevel fashion. We show that by combining these filtering techniques in a hierarchical fashion leads to a reduction in noise in the seismic data without removing the precursors and eruption event signals. We then use RF on the filtered data to classify the state of geyser into three classes: remnant noise, precursor, and eruption states. RF classifier is constructed based on the comprehensive features extracted using the Tsfresh Python package. We show that the classification accuracy using RF on the filtered data is greater than 90%.We also evaluate the accuracy of other classical time-series methods such as dynamic time warping (DTW) on filtered data along with RF on partially filtered data in which we remove the daily trends. Classification accuracy shows that DTW performs poorly (44%) and RF on partially filtered data performs decently (87%). Denoising seismic signals from both daily trends and human activity enhances RF classifier performance by 7%. These aspects make the proposed ML framework attractive for event discrimination and signal enhancement under noisy conditions, with strong potential for application to monitor leaks in CO 2 sequestration.

Original languageEnglish
Pages (from-to)591-603
Number of pages13
JournalSeismological Research Letters
Volume90
Issue number2 A
DOIs
StatePublished - Mar 2019
Externally publishedYes

Funding

The authors thank the support of the Los Alamos National Laboratory (LANL) Directed Research and Development Directed Research Award 20170004DR. The authors would like to extend their thanks to Jim Roberts of Chimayó, New Mexico, for the use of his property to perform the measurements used in this article. B. Y. and Y. J. T. like to thank the support of 2018 LANL Applied Machine Learning Summer School Fellowship. M. K. M. gratefully acknowledges the support of LANL Chick-Keller Postdoctoral Fellowship through Center for Space and Earth Sciences (CSES) and University of California (UC)/ LANL Entrepreneurial Postdoctoral Fellowship through Richard P. Feynman Center for Innovation. M. K. M., B. Y., and Y. J. T. thank Youzuo Lin for many useful discussions. M. K. M. also thanks Ting Chen, Molly Cernicek, and Don Hickmott for their inputs and feedback during the course of the project. The authors also want to acknowledge the comments provided by Karianne Bergen and the two anonymous reviewers that substantially improved the article. The authors thank the support of the Los Alamos National Laboratory (LANL) Directed Research and Development Directed Research Award 20170004DR. The authors would like to extend their thanks to Jim Roberts of Chimayó, New Mexico, for the use of his property to perform the measurements used in this article. B. Y. and Y. J. T. like to thank the support of 2018 LANL Applied Machine Learning Summer School Fellowship. M. K. M. gratefully acknowledges the support of LANL Chick- Keller Postdoctoral Fellowship through Center for Space and Earth Sciences (CSES) and University of California (UC)/ LANL Entrepreneurial Postdoctoral Fellowship through Richard P. Feynman Center for Innovation. M. K. M., B. Y., and Y. J. T. thank Youzuo Lin for many useful discussions. M. K. M. also thanks Ting Chen, Molly Cernicek, and Don Hickmott for their inputs and feedback during the course of the project. The authors also want to acknowledge the comments provided by Karianne Bergen and the two anonymous reviewers that substantially improved the article.

FundersFunder number
Center for Space and Earth Sciences
Los Alamos National Laboratory
University of California
Los Alamos National Laboratory20170004DR

    Fingerprint

    Dive into the research topics of 'Using machine learning to discern eruption in noisy environments: A case study using CO 2 -driven cold-water geyser in Chimayó, New Mexico'. Together they form a unique fingerprint.

    Cite this