Abstract
This paper presents an application-level non-blocking multicast scheme for dynamic DAG scheduling on large-scale distributed-memory systems. The multicast scheme takes into account both network topology and space requirement of routing tables to achieve scalability. Specifically, we prove that the scheme is deadlock-free and takes at most logN steps to complete. The routing table chooses appropriate neighbors to store based on topology IDs and has a small space of O(logN). Although built upon MPI point-to-point operations, the experimental results show that our scheme is significantly better than the simple flat-tree method and is comparable to vendor's collective MPI operations.
Original language | English |
---|---|
Title of host publication | Computational Science - ICCS 2009 - 9th International Conference, Proceedings |
Pages | 195-204 |
Number of pages | 10 |
Edition | PART 1 |
DOIs | |
State | Published - 2009 |
Event | 9th International Conference on Computational Science, ICCS 2009 - Baton Rouge, LA, United States Duration: May 25 2009 → May 27 2009 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Number | PART 1 |
Volume | 5544 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 9th International Conference on Computational Science, ICCS 2009 |
---|---|
Country/Territory | United States |
City | Baton Rouge, LA |
Period | 05/25/09 → 05/27/09 |
Funding
This material is based upon work supported by the Department of Energy Office of Science under grant No. DE-FC02-06ER25761 and by Microsoft Research.