Abstract
Scientific simulations on high-performance computing systems produce vast amounts of data that need to be stored and analyzed efficiently. Lossy compression significantly reduces the data volume by trading accuracy for performance. Despite the recent success of lossy compressions, such as ZFP and SZ, the compression performance is still far from being able to keep up with the exponential growth of data. This article aims to further take advantage of application characteristics, an area that is often under-explored, to improve the compression ratios of adaptive mesh refinement (AMR) - a widely used numerical solver that allows for an improved resolution in limited regions. We propose a level reordering technique zMeshto reduce the storage footprint of AMR applications. In particular, we group the data points that are mapped to the same or adjacent geometric coordinates such that the dataset is smoother and more compressible. Unlike the prior work where the compression performance is affected by the overhead of metadata, this work re-generates the restore recipe using a chained tree structure, thus involving no extra storage overhead for compressed data, which substantially improves the compression ratios. We further derive a mathematical proof that lays the foundation for our method. The results demonstrate that zMesh can improve the smoothness of data by 67.9% and 71.3% for Z-ordering and Hilbert, respectively. Overall, zMesh improves the compression ratios by up to 16.5% and 133.7% for ZFP and SZ, respectively. Despite that zMesh involves additional compute overhead for tree and restore recipe construction, we show that the cost can be amortized as the number of quantities to be compressed increases.
Original language | English |
---|---|
Pages (from-to) | 3702-3717 |
Number of pages | 16 |
Journal | IEEE Transactions on Parallel and Distributed Systems |
Volume | 33 |
Issue number | 12 |
DOIs | |
State | Published - Dec 1 2022 |
Funding
This work was supported in part by the Key-Area Research and Development Program of Guangdong Province under Grant 2021B0101190004, in part by the National Natural Science Foundation of China under Grant 62102141, and in part by US NSF under Grants CCF-1718297 and CCF-1812861. This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, in part by the Office of Science of the U.S. Department of Energy under Grant DE-AC05-00OR22725.
Funders | Funder number |
---|---|
National Science Foundation | CCF-1718297, CCF-1812861 |
U.S. Department of Energy | DE-AC05-00OR22725 |
Office of Science | |
Oak Ridge National Laboratory | |
National Natural Science Foundation of China | 62102141 |
Special Project for Research and Development in Key areas of Guangdong Province | 2021B0101190004 |
Keywords
- High-performance computing
- adaptive mesh refinement (AMR)
- data reduction
- lossy compression