TY - GEN
T1 - Entropy-driven Optimal Sub-sampling of Fluid Dynamics for Developing Machine-learned Surrogates
AU - Brewer, Wesley
AU - Martinez, Daniel
AU - Gopalakrishnan Meena, Muralikrishnan
AU - Kashi, Aditya
AU - Borowiec, Katarzyna
AU - Liu, Siyan
AU - Pilmaier, Christopher
AU - Burgreen, Greg
AU - Bhushan, Shanti
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/11/12
Y1 - 2023/11/12
N2 - Optimal sub-sampling of large datasets from fluid dynamics simulations is essential for training reduced-order machine learned models. A method using Shannon entropy was developed to weight flow features according to their level of information content, such that the most informative features can be extracted and used for training a surrogate model. The method is demonstrated in the canonical flow over a cylinder problem simulated with OpenFOAM. Both time-independent predictions and temporal forecasting were investigated as well as two types of prediction targets: local per-grid-point predictions and global per-time-step predictions. When tested on training a surrogate model, results indicate that our entropy-based sampling method typically outperforms random sampling and yields more reproducible results in less iterations. Finally, the method was used to train a surrogate model for modeling turbulence in magnetohydrodynamic flows, which revealed various challenges and opportunities for future research.
AB - Optimal sub-sampling of large datasets from fluid dynamics simulations is essential for training reduced-order machine learned models. A method using Shannon entropy was developed to weight flow features according to their level of information content, such that the most informative features can be extracted and used for training a surrogate model. The method is demonstrated in the canonical flow over a cylinder problem simulated with OpenFOAM. Both time-independent predictions and temporal forecasting were investigated as well as two types of prediction targets: local per-grid-point predictions and global per-time-step predictions. When tested on training a surrogate model, results indicate that our entropy-based sampling method typically outperforms random sampling and yields more reproducible results in less iterations. Finally, the method was used to train a surrogate model for modeling turbulence in magnetohydrodynamic flows, which revealed various challenges and opportunities for future research.
KW - clustering
KW - maximum entropy
KW - reduced-order
KW - sampling
KW - surrogate
UR - http://www.scopus.com/inward/record.url?scp=85178160342&partnerID=8YFLogxK
U2 - 10.1145/3624062.3626084
DO - 10.1145/3624062.3626084
M3 - Conference contribution
AN - SCOPUS:85178160342
T3 - ACM International Conference Proceeding Series
SP - 73
EP - 80
BT - Proceedings of 2023 SC Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis, SC Workshops 2023
PB - Association for Computing Machinery
T2 - 2023 International Conference on High Performance Computing, Network, Storage, and Analysis, SC Workshops 2023
Y2 - 12 November 2023 through 17 November 2023
ER -