Skip to main navigation Skip to search Skip to main content

Foundation Model for Lossy Compression of Spatiotemporal Scientific Data

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

We present a foundation model (FM) for lossy scientific data compression, combining a variational autoencoder (VAE) with a hyper-prior structure and a super-resolution (SR) module. The VAE framework uses hyper-priors to model latent space dependencies, enhancing compression efficiency. The SR module refines low-resolution representations into high-resolution outputs, improving reconstruction quality. By alternating between 2D and 3D convolutions, the model efficiently captures spatiotemporal correlations in scientific data while maintaining low computational cost. Experimental results demonstrate that the FM generalizes well to unseen domains and varying data shapes, achieving up to 4× higher compression ratios than state-of-the-art methods after domain-specific fine-tuning. The SR module improves compression ratio by 30% compared to simple upsampling techniques. This approach significantly reduces storage and transmission costs for large-scale scientific simulations while preserving data integrity and fidelity.

Original languageEnglish
Title of host publicationData Science
Subtitle of host publicationFoundations and Applications - 29th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2025, Sydney, NSW, Australia, June 10-13, 2025, Proceedings
EditorsXintao Wu, Myra Spiliopoulou, Can Wang, Vipin Kumar, Longbing Cao, Xiangmin Zhou, Guansong Pang, Joao Gama
PublisherSpringer Science and Business Media Deutschland GmbH
Pages368-380
Number of pages13
ISBN (Print)9789819682942
DOIs
StatePublished - 2025
Externally publishedYes
Event29th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2025 - Sydney, Australia
Duration: Jun 10 2025Jun 13 2025

Publication series

NameLecture Notes in Computer Science
Volume15875 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference29th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2025
Country/TerritoryAustralia
CitySydney
Period06/10/2506/13/25

Keywords

  • Data Compression
  • Foundation Models
  • Spatiotemporal Scientific Data

Fingerprint

Dive into the research topics of 'Foundation Model for Lossy Compression of Spatiotemporal Scientific Data'. Together they form a unique fingerprint.

Cite this