Abstract
The frequency of hardware errors in HPC systems continues to grow as system designs evolve toward exascale. Tolerating these errors efficiently and effectively will require software-based resilience solutions. With this requirement in mind, recent research has increasingly employed LLVM-based tools to simulate transient hardware faults in order to study the resilience characteristics of specific applications. However, such tools require researchers to configure their experiments at the level of the LLVM intermediate representation (LLVM IR) rather than at the source level of the applications under study. In this paper, we present FITL1 (Fault-Injection Toolkit for LLVM), a set of LLVM extensions to which it is straightforward to translate source-level pragmas that specify fault injection. While we have designed FITL not to be tied to any particular compiler front end or high-level language, we also describe how we have extended our OpenARC compiler to translate a novel set of fault-injection pragmas for C to FITL. Finally, we present several resilience studies we have conducted using FITL, including a comparison with a source-level fault injector we have built as part of OpenARC.
Original language | English |
---|---|
Title of host publication | Proceedings of LLVM-HPC 2015 |
Subtitle of host publication | 2nd Workshop on the LLVM Compiler Infrastructure in HPC - Held in conjunction with SC 2015: The International Conference for High Performance Computing, Networking, Storage and Analysis |
Publisher | Association for Computing Machinery, Inc |
ISBN (Electronic) | 9781450340052 |
DOIs | |
State | Published - Nov 15 2015 |
Event | 2nd Workshop on the LLVM Compiler Infrastructure in HPC, LLVM-HPC 2015 - Austin, United States Duration: Nov 15 2015 → … |
Publication series
Name | Proceedings of LLVM-HPC 2015: 2nd Workshop on the LLVM Compiler Infrastructure in HPC - Held in conjunction with SC 2015: The International Conference for High Performance Computing, Networking, Storage and Analysis |
---|
Conference
Conference | 2nd Workshop on the LLVM Compiler Infrastructure in HPC, LLVM-HPC 2015 |
---|---|
Country/Territory | United States |
City | Austin |
Period | 11/15/15 → … |
Bibliographical note
Publisher Copyright:© 2015 ACM.
Keywords
- C pragma
- Compiler intermediate representation
- Directive-based programming
- Fault injection
- LLVM
- Ope-nARC
- Resiliency