Abstract
Resilience investigations often employ fault injection (FI) tools to study the effects of simulated errors on a target system. It is important to keep the target system under test (SUT) isolated from the controlling environment in order to maintain control of the experiement. Virtual machines (VMs) have been used to aid these investigations due to the strong isolation properties of system-level virtualization. A key challenge in fault injection tools is to gain proper insight and context about the SUT. In VM-based FI tools, this challenge of target context is increased due to the separation between host and guest (VM). We discuss an approach to VM-based FI that leverages virtual machine introspection (VMI) methods to gain insight into the target’s context running within the VM. The key to this environment is the ability to provide basic information to the FI system that can be used to create a map of the target environment. We describe a proof-of-concept implementation and a demonstration of its use to introduce simulated soft errors into an iterative solver benchmark running in user-space of a guest VM.
Original language | English |
---|---|
Title of host publication | Euro-Par 2016 |
Subtitle of host publication | Parallel Processing Workshops - Euro-Par 2016 International Workshops, Revised Selected Papers |
Editors | Pierre-Francois Dutot, Frederic Desprez |
Publisher | Springer Verlag |
Pages | 671-682 |
Number of pages | 12 |
ISBN (Print) | 9783319589428 |
DOIs | |
State | Published - 2017 |
Event | 22nd International Conference on Parallel and Distributed Computing, Euro-Par 2016 - Grenoble, France Duration: Aug 24 2016 → Aug 26 2016 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 10104 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 22nd International Conference on Parallel and Distributed Computing, Euro-Par 2016 |
---|---|
Country/Territory | France |
City | Grenoble |
Period | 08/24/16 → 08/26/16 |
Funding
T. Naughton—This manuscript has been authored by UT-Battelle, LLC under Contract No.DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan ( http://energy.gov/downloads/doe-public-access-plan ). This material is based upon work supported by the U.S. Department of Energy, Office of Science, Advanced Scientific Computing Research program.
Keywords
- Fault injection
- Resilience tools
- Virtual machine introspection
- Virtualization