Abstract
Discrete wavelet methods, originally formulated in the setting of regularly sampled signals, can be adapted to data defined on a point cloud if some multiresolution structure is imposed on the cloud. A wide variety of hierarchical clustering algorithms can be used for this purpose, and the multiresolution structure obtained can be encoded by a hierarchical tree of subsets of the cloud. Prior work introduced the use of Haar-like bases defined with respect to such trees for approximation and learning tasks on unstructured data. This paper builds on that work in two directions. First, we present an algorithm for constructing Haar-like bases on general discrete hierarchical trees. Second, with an eye towards data compression, we present thresholding techniques for data defined on a point cloud with error controlled in the L∞ norm and in a Hölder-type norm. In a concluding trio of numerical examples, we apply our methods to compress a point cloud dataset, study the tightness of the L∞ error bound, and use thresholding to identify MNIST classifiers with good generalizability.
Original language | English |
---|---|
Article number | 3 |
Journal | Journal of Scientific Computing |
Volume | 99 |
Issue number | 1 |
DOIs | |
State | Published - Apr 2024 |
Funding
This material is based upon work supported by the US Department of Energy, Office of Science, Office of Advanced Scientific Computing Research, Scientific Discovery through Advanced Computing (SciDAC) program under the FASTMath institute and the scientific data compression project.
Funders | Funder number |
---|---|
FASTMath Institute | |
U.S. Department of Energy | |
Office of Science | |
Advanced Scientific Computing Research | DE-SC0022297 - (FASTMath) |
Keywords
- 65T60
- Euclidean metric approximation
- Lossy compression
- Unstructured data