TY - GEN
T1 - Using MPI file caching to improve parallel write performance for large-scale scientific applications
AU - Liao, Wei Keng
AU - Ching, Avery
AU - Coloma, Kenin
AU - Nisar, Arifa
AU - Choudhary, Alok
AU - Chen, Jacqueline
AU - Sankaran, Ramanan
AU - Klasky, Scott
PY - 2007
Y1 - 2007
N2 - Typical large-scale scientific applications periodically write checkpoint files to save the computational state throughout execution. Existing parallel file systems improve such write-only I/O patterns through the use of client-side file caching and write-behind strategies. In distributed environments where files are rarely accessed by more than one client concurrently, file caching has achieved significant success; however, in parallel applications where multiple clients manipulate a shared file, cache coherence control can serialize I/O. We have designed a thread based caching layer for the MPI I/O library, which adds a portable caching system closer to user applications so more information about the application's I/O patterns is available for better coherence control. We demonstrate the impact of our caching solution on parallel write performance with a comprehensive evaluation that includes a set of widely used I/O benchmarks and production application I/O kernels. (c) 2007 ACM.
AB - Typical large-scale scientific applications periodically write checkpoint files to save the computational state throughout execution. Existing parallel file systems improve such write-only I/O patterns through the use of client-side file caching and write-behind strategies. In distributed environments where files are rarely accessed by more than one client concurrently, file caching has achieved significant success; however, in parallel applications where multiple clients manipulate a shared file, cache coherence control can serialize I/O. We have designed a thread based caching layer for the MPI I/O library, which adds a portable caching system closer to user applications so more information about the application's I/O patterns is available for better coherence control. We demonstrate the impact of our caching solution on parallel write performance with a comprehensive evaluation that includes a set of widely used I/O benchmarks and production application I/O kernels. (c) 2007 ACM.
UR - http://www.scopus.com/inward/record.url?scp=56749160536&partnerID=8YFLogxK
U2 - 10.1145/1362622.1362634
DO - 10.1145/1362622.1362634
M3 - Conference contribution
AN - SCOPUS:56749160536
SN - 9781595937643
T3 - Proceedings of the 2007 ACM/IEEE Conference on Supercomputing, SC'07
BT - Proceedings of the 2007 ACM/IEEE Conference on Supercomputing, SC'07
T2 - 2007 ACM/IEEE Conference on Supercomputing, SC'07
Y2 - 10 November 2007 through 16 November 2007
ER -