TY - GEN
T1 - Exploiting lustre file joining for effective collective IO
AU - Yu, Weikuan
AU - Vetter, Jeffrey
AU - Canon, R. Shane
AU - Jiang, Song
PY - 2007
Y1 - 2007
N2 - Lustre is a parallel file system that presents high aggregated IO bandwidth by striping file extents across many storage devices. However, our experiments indicate excessively wide striping can cause performance degradation. Lustre supports an innovative file joining feature that joins files in place. To mitigate striping overhead and benefit collective 10, we propose two techniques: split writing and hierarchical striping. In split writing, a file is created as separate subfiles, each of which is striped to only a few storage devices. They are joined as a single file at the file close time. Hierarchical striping builds on top of split writing and orchestrates the span of subfiles in a hierarchical manner to avoid overlapping and achieve the appropriate coverage of storage devices. Together, these techniques can avoid the overhead associated with large stripe width, while still being able to combine bandwidth available from many storage devices. We have prototyped these techniques in the ROMIO implementation of MPI-IO. Experimental results indicate that split writing and hierarchical striping can significantly improve the performance of Lustre collective IO in terms of both data transfer and management operations. On a Lustre file system configured with 46 object storage targets, our implementation improves collective write performance of a 16-process job by as much as 220%.
AB - Lustre is a parallel file system that presents high aggregated IO bandwidth by striping file extents across many storage devices. However, our experiments indicate excessively wide striping can cause performance degradation. Lustre supports an innovative file joining feature that joins files in place. To mitigate striping overhead and benefit collective 10, we propose two techniques: split writing and hierarchical striping. In split writing, a file is created as separate subfiles, each of which is striped to only a few storage devices. They are joined as a single file at the file close time. Hierarchical striping builds on top of split writing and orchestrates the span of subfiles in a hierarchical manner to avoid overlapping and achieve the appropriate coverage of storage devices. Together, these techniques can avoid the overhead associated with large stripe width, while still being able to combine bandwidth available from many storage devices. We have prototyped these techniques in the ROMIO implementation of MPI-IO. Experimental results indicate that split writing and hierarchical striping can significantly improve the performance of Lustre collective IO in terms of both data transfer and management operations. On a Lustre file system configured with 46 object storage targets, our implementation improves collective write performance of a 16-process job by as much as 220%.
UR - http://www.scopus.com/inward/record.url?scp=34548361907&partnerID=8YFLogxK
U2 - 10.1109/CCGRID.2007.51
DO - 10.1109/CCGRID.2007.51
M3 - Conference contribution
AN - SCOPUS:34548361907
SN - 0769528333
SN - 9780769528335
T3 - Proceedings - Seventh IEEE International Symposium on Cluster Computing and the Grid, CCGrid 2007
SP - 267
EP - 274
BT - Proceedings - Seventh IEEE International Symposium on Cluster Computing and the Grid, CCGrid 2007
T2 - 7th IEEE International Symposium on Cluster Computing and the Grid, CCGrid 2007
Y2 - 14 May 2007 through 17 May 2007
ER -