Improved Accuracy for Automated Communication Pattern Characterization Using Communication Graphs and Aggressive Search Space Pruning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

An understanding of a parallel application’s communication behavior is useful for a range of activities including debugging and optimization, job scheduling, target system selection, and system design. Because it can be challenging to understand communication behavior, especially for those who lack expertise or who are not familiar with the application, I and two colleagues recently developed an automated, search-based approach for recognizing and parameterizing application communication behavior using a library of common communication patterns. This initial approach was effective for characterizing the behavior of many workloads, but I identified some combinations of communication patterns for which the method was inefficient or would fail. In this paper, I discuss one such troublesome pattern combination and propose modifications to the recognition method to handle it. Specifically, I propose an alternative approach that uses communication graphs instead of traditional communication matrices to improve recognition accuracy for collective communication operations, and that uses a non-greedy recognition technique to avoid search space dead-ends that trap the original greedy recognition approach. My modified approach uses aggressive search space pruning and heuristics to control the potential for state explosion caused by its non-greedy pattern recognition method. I demonstrate the improved recognition accuracy and pruning efficacy of the modified approach using several synthetic and real-world communication pattern combinations.

Original languageEnglish
Title of host publicationProgramming and Performance Visualization Tools - International Workshops, ESPT 2017 and VPA 2017, Revised Selected Papers
EditorsAbhinav Bhatele, David Boehme, Joshua A. Levine, Allen D. Malony, Martin Schulz
PublisherSpringer Verlag
Pages38-55
Number of pages18
ISBN (Print)9783030178710
DOIs
StatePublished - 2019
Event6th Workshop on Extreme-Scale Programming Tools, ESPT 2017 and 4th International Workshop on Visual Performance Analysis, VPA 2017 and Workshop on Extreme-Scale Programming Tools, ESPT 2018 and 5th International Workshop on Visual Performance Analysis, VPA 2018 - Dallas, United States
Duration: Nov 11 2018Nov 16 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11027 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th Workshop on Extreme-Scale Programming Tools, ESPT 2017 and 4th International Workshop on Visual Performance Analysis, VPA 2017 and Workshop on Extreme-Scale Programming Tools, ESPT 2018 and 5th International Workshop on Visual Performance Analysis, VPA 2018
Country/TerritoryUnited States
CityDallas
Period11/11/1811/16/18

Funding

This manuscript has been authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The US government retains and the publisher, by accepting the article for publication, acknowledges that the US government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for US government purposes. DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy. gov/downloads/doe-public-access-plan). This research is sponsored by the Office of Advanced Scientific Computing Research in the U.S. Department of Energy. Acknowledgements. This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.

FundersFunder number
U.S. Department of EnergyDE-AC05-00OR22725
Office of Science
Advanced Scientific Computing Research

    Fingerprint

    Dive into the research topics of 'Improved Accuracy for Automated Communication Pattern Characterization Using Communication Graphs and Aggressive Search Space Pruning'. Together they form a unique fingerprint.

    Cite this