Abstract
Community detection is a fundamental operation in graph mining, and by uncovering hidden structures and patterns within complex systems it helps solve fundamental problems pertaining to social networks, such as information diffusion, epidemics, and recommender systems. Scaling graph algorithms for massive networks becomes challenging on modern distributed-memory multi-GPU (Graphics Processing Unit) systems due to limitations such as irregular memory access patterns, load imbalances, higher communication-computation ratios, and cross-platform support. We present a novel algorithm HiPDPL-GPU (Distributed Parallel Louvain) to address these challenges. We conduct experiments involving different partitioning techniques to achieve an optimized performance of HiPDPL-GPU on the two largest supercomputers: Frontier and Summit. Remarkably, HiPDPL-GPU processes a graph with 4.2 billion edges in less than 3 minutes using 1024 GPUs. Qualitatively, the performance of HiPDPL-GPU is similar or better compared to other state-of-the-art CPU- and GPU-based implementations. While prior GPU implementations have predominantly employed CUDA, our first-of-its-kind implementation for community detection is cross-platform, accommodating both AMD and NVIDIA GPUs.
| Original language | English |
|---|---|
| Title of host publication | 2024 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2024 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 815-824 |
| Number of pages | 10 |
| ISBN (Electronic) | 9798350364606 |
| DOIs | |
| State | Published - 2024 |
| Event | 2024 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2024 - San Francisco, United States Duration: May 27 2024 → May 31 2024 |
Publication series
| Name | 2024 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2024 |
|---|
Conference
| Conference | 2024 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2024 |
|---|---|
| Country/Territory | United States |
| City | San Francisco |
| Period | 05/27/24 → 05/31/24 |
Funding
This work has been supported by UT-Battelle, LLC under Contract No. DEAC05-00OR22725 with the U.S. Department of Energy (DOE) and DOE s Exascale Computing Project s (ECP) (17-SC-20-SC) ExaGraph codesign center at the Pacific Northwest National Laboratory. We used resources of the Oak Ridge Leadership Computing Facility located in the National Center for Computational Sciences at ORNL, which is managed by UT Battelle, LLC for the U.S. DOE (under the contract No. DE-AC05-00OR22725).
Keywords
- HIP
- Louvain
- MPI
- clustering
- community detection
- hybrid
- multi-GPU