A cooperative approach to virtual machine based fault injection

Thomas Naughton, Christian Engelmann, Geoffroy Vallée, Ferrol Aderholdt, Stephen L. Scott

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Resilience investigations often employ fault injection (FI) tools to study the effects of simulated errors on a target system. It is important to keep the target system under test (SUT) isolated from the controlling environment in order to maintain control of the experiement. Virtual machines (VMs) have been used to aid these investigations due to the strong isolation properties of system-level virtualization. A key challenge in fault injection tools is to gain proper insight and context about the SUT. In VM-based FI tools, this challenge of target context is increased due to the separation between host and guest (VM). We discuss an approach to VM-based FI that leverages virtual machine introspection (VMI) methods to gain insight into the target’s context running within the VM. The key to this environment is the ability to provide basic information to the FI system that can be used to create a map of the target environment. We describe a proof-of-concept implementation and a demonstration of its use to introduce simulated soft errors into an iterative solver benchmark running in user-space of a guest VM.

Original languageEnglish
Title of host publicationEuro-Par 2016
Subtitle of host publicationParallel Processing Workshops - Euro-Par 2016 International Workshops, Revised Selected Papers
EditorsPierre-Francois Dutot, Frederic Desprez
PublisherSpringer Verlag
Pages671-682
Number of pages12
ISBN (Print)9783319589428
DOIs
StatePublished - 2017
Event22nd International Conference on Parallel and Distributed Computing, Euro-Par 2016 - Grenoble, France
Duration: Aug 24 2016Aug 26 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10104 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference22nd International Conference on Parallel and Distributed Computing, Euro-Par 2016
Country/TerritoryFrance
CityGrenoble
Period08/24/1608/26/16

Funding

T. Naughton—This manuscript has been authored by UT-Battelle, LLC under Contract No.DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan ( http://energy.gov/downloads/doe-public-access-plan ). This material is based upon work supported by the U.S. Department of Energy, Office of Science, Advanced Scientific Computing Research program.

Keywords

  • Fault injection
  • Resilience tools
  • Virtual machine introspection
  • Virtualization

Fingerprint

Dive into the research topics of 'A cooperative approach to virtual machine based fault injection'. Together they form a unique fingerprint.

Cite this