TY - GEN
T1 - Spatiotemporally Adaptive Compression for Scientific Dataset with Feature Preservation - A Case Study on Simulation Data with Extreme Climate Events Analysis
AU - Gong, Qian
AU - Zhang, Chengzhu
AU - Liang, Xin
AU - Reshniak, Viktor
AU - Chen, Jieyang
AU - Rangarajan, Anand
AU - Ranka, Sanjay
AU - Vidal, Nicolas
AU - Wan, Lipeng
AU - Ullrich, Paul
AU - Podhorszki, Norbert
AU - Jacob, Robert
AU - Klasky, Scott
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Scientific discoveries are increasingly constrained by limited storage space and I/O capacities. For time-series simulations and experiments, their data often need to be decimated over timesteps to accommodate storage and I/O limitations. In this paper, we propose a technique that addresses storage costs while improving post-analysis accuracy through spatiotemporal adaptive, error-controlled lossy compression. We investigate the trade-off between data precision and temporal output rates, revealing that reducing data precision and increasing timestep frequency lead to more accurate analysis outcomes. Additionally, we integrate spatiotemporal feature detection with data compression and demonstrate that performing adaptive error-bounded compression in higher dimensional space enables greater compression ratios, leveraging the error propagation theory of a transformation-based compressor. To evaluate our approach, we conduct experiments using the well-known E3SM climate simulation code and apply our method to compress variables used for cyclone tracking. Our results show a significant reduction in storage size while enhancing the quality of cyclone tracking analysis, both quantitatively and qualitatively, in comparison to the prevalent timestep decimation approach. Compared to three state-of-the-art lossy compressors lacking feature preservation capabilities, our adaptive compression framework improves perfectly matched cases in TC tracking by 26.4-51.3% at medium compression ratios and by 77.3-571.1% at large compression ratios, with a merely 5-11% computational overhead.
AB - Scientific discoveries are increasingly constrained by limited storage space and I/O capacities. For time-series simulations and experiments, their data often need to be decimated over timesteps to accommodate storage and I/O limitations. In this paper, we propose a technique that addresses storage costs while improving post-analysis accuracy through spatiotemporal adaptive, error-controlled lossy compression. We investigate the trade-off between data precision and temporal output rates, revealing that reducing data precision and increasing timestep frequency lead to more accurate analysis outcomes. Additionally, we integrate spatiotemporal feature detection with data compression and demonstrate that performing adaptive error-bounded compression in higher dimensional space enables greater compression ratios, leveraging the error propagation theory of a transformation-based compressor. To evaluate our approach, we conduct experiments using the well-known E3SM climate simulation code and apply our method to compress variables used for cyclone tracking. Our results show a significant reduction in storage size while enhancing the quality of cyclone tracking analysis, both quantitatively and qualitatively, in comparison to the prevalent timestep decimation approach. Compared to three state-of-the-art lossy compressors lacking feature preservation capabilities, our adaptive compression framework improves perfectly matched cases in TC tracking by 26.4-51.3% at medium compression ratios and by 77.3-571.1% at large compression ratios, with a merely 5-11% computational overhead.
KW - feature preservation
KW - region-wise error-controlled lossy compression
KW - spatiotemporal data
KW - timestep decimation
UR - http://www.scopus.com/inward/record.url?scp=85174228641&partnerID=8YFLogxK
U2 - 10.1109/e-Science58273.2023.10254796
DO - 10.1109/e-Science58273.2023.10254796
M3 - Conference contribution
AN - SCOPUS:85174228641
T3 - Proceedings 2023 IEEE 19th International Conference on e-Science, e-Science 2023
BT - Proceedings 2023 IEEE 19th International Conference on e-Science, e-Science 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 19th IEEE International Conference on e-Science, e-Science 2023
Y2 - 9 October 2023 through 14 October 2023
ER -