Abstract
High productivity to the end user is critical in harnessing the power of high performance computing systems to solve science and engineering problems. It is a challenge to bridge the gap between the hardware complexity and the software limitations. Despite significant progress in language, compiler, and performance tools, tuning an application remains largely a manual task, and is done mostly by experts. In this paper we propose a holistic approach towards automated performance analysis and tuning that we expect to greatly improve the productivity of performance debugging. Our approach seeks to build a framework that facilitates the combination of expert knowledge, compiler techniques, and performance research for performance diagnosis and solution discovery. With our framework, once a diagnosis and tuning strategy has been developed, it can be stored in an open and extensible database and thus be reused in the future. We demonstrate the effectiveness of our approach through the automated performance analysis and tuning of two scientific applications. We show that the tuning process is highly automated, and the performance improvement is significant.
Original language | English |
---|---|
Title of host publication | Euro-Par 2009 Parallel Processing - 15th International Euro-Par Conference, Proceedings |
Pages | 33-44 |
Number of pages | 12 |
DOIs | |
State | Published - 2009 |
Externally published | Yes |
Event | Euro-Par 2009 Parallel Processing - 15th International Euro-Par Conference, Proceedings - Delft, Netherlands Duration: Aug 25 2009 → Aug 28 2009 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 5704 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | Euro-Par 2009 Parallel Processing - 15th International Euro-Par Conference, Proceedings |
---|---|
Country/Territory | Netherlands |
City | Delft |
Period | 08/25/09 → 08/28/09 |
Funding
This material is based upon work supported by the Defense Advanced Research Projects Agency under its Agreement No. HR0011-07-9-0002.