Abstract
Jaccard weights are a popular metric for identifying communities in social network analytics. In this paper we present a kernel for efficiently computing the Jaccard weight matrix on G PU s. The kernel design is guided by fine-grained parallelism and the independent thread scheduling supported by NVIDIA's Volta architecture. This technology makes it possible to interleave the execution of divergent branches for enhanced data reuse and a higher instruction per cycle rate for memory-bound algorithms. In a performance evaluation using a set of publicly available social networks, we report the kernel execution time and analyze the built-in hardware counters on different GPU architectures. The findings have implications beyond the specific algorithm and suggest a reformulation of other data-sparse algorithms.
Original language | English |
---|---|
Title of host publication | Proceedings - 2018 30th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2018 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 229-232 |
Number of pages | 4 |
ISBN (Electronic) | 9781538677698 |
DOIs | |
State | Published - Jul 2 2018 |
Event | 30th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2018 - Lyon, France Duration: Sep 24 2018 → Sep 27 2018 |
Publication series
Name | Proceedings - 2018 30th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2018 |
---|
Conference
Conference | 30th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2018 |
---|---|
Country/Territory | France |
City | Lyon |
Period | 09/24/18 → 09/27/18 |
Funding
ACKNOWLEDGMENT This material is based upon work supported by the U.S. Department of Energy Office of Science, Office of Advanced Scientific Computing Research, Applied Mathematics program under Award Number DE-SC0016513. H. Anzt was supported by the “Impuls und Vernetzungsfondof the Helmholtz Association” under grant VH-NG-1241. The authors would like to thank the High Performance Computing & Architectures (HPCA) group at the University of Jaume for granting access to the TITAN X GPU.