Abstract
We consider high performance computing infrastructures consisting of multiple sites connected over a wide-area network. These sites house heterogeneous computing systems, network elements and local-area connections, and the wide-area network plays a critical, asymmetric role of providing the vital connectivity between them. We model this infrastructure as a recursive system of systems, wherein the basic systems at the finest levels are composed of discrete cyber and physical components. These components may be disabled by cyber and physical disruptions, and their reinforcements protect against them albeit at a cost. We utilize results from a game formulation based on (i) characterizing the disruptions between systems at each level of recursion using aggregate failure correlation functions, and within basic systems using the multiplier functions, and (ii) using utility functions expressed in terms of the number of basic system components attacked and reinforced. At Nash Equilibrium, we derive expressions for the expected capacity of the infrastructure given by the number of computing nodes that are operational and connected to the network.
Original language | English |
---|---|
Title of host publication | ICDCN 2019 - Proceedings of the 2019 International Conference on Distributed Computing and Networking |
Publisher | Association for Computing Machinery |
Pages | 143-147 |
Number of pages | 5 |
ISBN (Electronic) | 9781450360944 |
DOIs | |
State | Published - Jan 4 2019 |
Event | 20th International Conference on Distributed Computing and Networking, ICDCN 2019 - Bangalore, India Duration: Jan 4 2019 → Jan 7 2019 |
Publication series
Name | ACM International Conference Proceeding Series |
---|
Conference
Conference | 20th International Conference on Distributed Computing and Networking, ICDCN 2019 |
---|---|
Country/Territory | India |
City | Bangalore |
Period | 01/4/19 → 01/7/19 |
Funding
This work is funded by the Mathematics of Complex, Distributed, Interconnected Systems Program and RAMSES project, Office of Advanced Computing Research, U.S. Department of Energy, and by Extreme Scale Systems Center, sponsored by U. S. Department of Defense, and performed at Oak Ridge National Laboratory managed by UT-Battelle, LLC for U.S. Department of Energy under Contract No. DE-AC05-00OR22725.
Keywords
- Aggregated correlation function
- Composite utility functions
- Game theory
- High performance computing infrastructure
- Nash Equilibrium
- Networked systems