Abstract
In complex phenomena such as epidemiological outbreaks, the intensity of inherent feedback effects and the significant role of transients in the dynamics make simulation the only effective method for proactive, reactive or post facto analysis. The spatial scale, runtime speed, and behavioral detail needed in detailed simulations of epidemic outbreaks cannot be supported by sequential or small-scale parallel execution, making it necessary to use large-scale parallel processing. Here, an optimistic parallel execution of a new discrete event formulation of a reaction–diffusion simulation model of epidemic propagation is presented to facilitate a dramatic increase in the fidelity and speed by which epidemiological simulations can be performed. Rollback support needed during optimistic parallel execution is achieved by combining reverse computation with a small amount of incremental state saving. Parallel speedup of over 5,500 and other runtime performance metrics of the system are observed with weak-scaling execution on a small (8,192-core) Blue Gene/P system, while scalability with a weak-scaling speedup of over 10,000 is demonstrated on 65,536 cores of a large Cray XT5 system. Scenarios representing large population sizes, with mobility and detailed state evolution modeled at the level of each individual, exceeding several hundreds of millions of individuals in the largest cases, are successfully exercised to verify model scalability.
Original language | English |
---|---|
Pages (from-to) | 768-783 |
Number of pages | 16 |
Journal | SIMULATION |
Volume | 88 |
Issue number | 7 |
DOIs | |
State | Published - Jul 2012 |
Funding
This paper has been authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the U.S. Department of Energy. Accordingly, the United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. This effort has been partly supported by the Laboratory Directed Research and Development Program of Oak Ridge National Laboratory (ORNL), and in part by the DOE Office of Science, Advanced Scientific Computing Research, Career Research Program. This research utilized resources of the National Center for Computational Sciences at ORNL.
Funders | Funder number |
---|---|
U.S. Department of Energy | |
Office of Science | |
Advanced Scientific Computing Research | |
Oak Ridge National Laboratory |
Keywords
- discrete event
- epidemiology
- high performance computing
- reverse computation