Abstract
Neural networks are increasingly integrated into scientific discovery, where input data reduction and model quantization play a key role in accelerating inference. However, understanding and mitigating the impact of these techniques on output error is critical for ensuring reliable results, particularly in tasks demanding high numerical precision. This paper introduces a comprehensive framework for optimizing neural network inference in scientific computing by combining data reduction and model quantization while maintaining error-controlled outcomes. We develop theoretical analyses to bound error propagation under these techniques and propose a framework that balances computational performance with error constraints. Evaluation on real-world learning-based combustion simulations and satellite image classification shows that our derived error bounds accurately predict observed errors while enabling significant computational speedup under our framework. This work highlights the potential for further leveraging advancements in modern lossy compression algorithms and hardware accelerators that support lower-precision formats.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025 |
| Publisher | IEEE Computer Society |
| Pages | 1869-1881 |
| Number of pages | 13 |
| ISBN (Electronic) | 9798331536039 |
| DOIs | |
| State | Published - 2025 |
| Event | 41st IEEE International Conference on Data Engineering, ICDE 2025 - Hong Kong, China Duration: May 19 2025 → May 23 2025 |
Publication series
| Name | Proceedings - International Conference on Data Engineering |
|---|---|
| ISSN (Print) | 1084-4627 |
| ISSN (Electronic) | 2375-0286 |
Conference
| Conference | 41st IEEE International Conference on Data Engineering, ICDE 2025 |
|---|---|
| Country/Territory | China |
| City | Hong Kong |
| Period | 05/19/25 → 05/23/25 |
Funding
This manuscript has been authored in part by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The publisher, by accepting the article for publication, acknowledges that the U.S. Government retains a non-exclusive, paid up, irrevocable, worldwide license to publish or reproduce the published form of the manuscript, or allow others to do so, for U.S. Government purposes. The DOE will provide public access to these results in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).