TY - JOUR
T1 - Data locality optimization for synthesis of efficient out-of-core algorithms
AU - Krishnan, Sandhya
AU - Krishnamoorthy, Sriram
AU - Baumgartner, Gerald
AU - Cociorva, Daniel
AU - Lam, Chi Chung
AU - Sadayappan, P.
AU - Ramanujam, J.
AU - Bernholdt, David E.
AU - Choppella, Venkatesh
PY - 2003
Y1 - 2003
N2 - This paper describes an approach to synthesis of efficient out-of-core code for a class of imperfectly nested loops that represent tensor contraction computations. Tensor contraction expressions arise in many accurate computational models of electronic structure. The developed approach combines loop fusion with loop tiling and uses a performance-model driven approach to loop tiling for the generation of out-of-core code. Experimental measurements are provided that show a good match with model-based predictions and demonstrate the effectiveness of the proposed algorithm.
AB - This paper describes an approach to synthesis of efficient out-of-core code for a class of imperfectly nested loops that represent tensor contraction computations. Tensor contraction expressions arise in many accurate computational models of electronic structure. The developed approach combines loop fusion with loop tiling and uses a performance-model driven approach to loop tiling for the generation of out-of-core code. Experimental measurements are provided that show a good match with model-based predictions and demonstrate the effectiveness of the proposed algorithm.
UR - http://www.scopus.com/inward/record.url?scp=21144446087&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-24596-4_44
DO - 10.1007/978-3-540-24596-4_44
M3 - Article
AN - SCOPUS:21144446087
SN - 0302-9743
VL - 2913
SP - 406
EP - 417
JO - Lecture Notes in Computer Science
JF - Lecture Notes in Computer Science
ER -