TY - GEN
T1 - An Efficient and Scalable Algorithmic Method for Generating Large-Scale Random Graphs
AU - Alam, Maksudul
AU - Khan, Maleq
AU - Vullikanti, Anil
AU - Marathe, Madhav
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/7/2
Y1 - 2016/7/2
N2 - Many real-world systems and networks are modeled and analyzed using various random graph models. These models must incorporate relevant properties such as degree distribution and clustering coefficient. Many models, such as the Chung-Lu (CL), stochastic Kronecker, stochastic block model (SBM), and block two-level Erds-Rényi (BTER) models have been devised to capture those properties. However, the generative algorithms for these models are mostly sequential and take prohibitively long time to generate large-scale graphs. In this paper, we present a novel time and space efficient algorithmic method to generate random graphs using CL, BTER, and SBM models. First, we present an efficient sequential algorithm and an efficient distributed-memory parallel algorithm for the CL model. Our sequential algorithm takes O(m) time and O(Λ) space, where m and Λ are the number of edges and distinct degrees, and our parallel algorithm takes O (m/p + Λ + P) time w.h.p. and O(Λ) space using P processors. These algorithms are almost time optimal since any sequential and parallel algorithms need at least Ω(m) and Ω(m/p) time, respectively. Our algorithms outperform the best known previous algorithms by a significant margin in terms of both time and space. Experimental results on various large-scale networks show that both of our sequential and parallel algorithms require 400-15000 times less memory than the existing sequential and parallel algorithms, respectively, making our algorithms suitable for generating very large-scale networks. Moreover, both of our algorithms are about 3-4 times faster than the existing sequential and parallel algorithms. Finally, we show how our algorithmic method also leads to efficient parallel and sequential algorithms for the SBM and BTER models.
AB - Many real-world systems and networks are modeled and analyzed using various random graph models. These models must incorporate relevant properties such as degree distribution and clustering coefficient. Many models, such as the Chung-Lu (CL), stochastic Kronecker, stochastic block model (SBM), and block two-level Erds-Rényi (BTER) models have been devised to capture those properties. However, the generative algorithms for these models are mostly sequential and take prohibitively long time to generate large-scale graphs. In this paper, we present a novel time and space efficient algorithmic method to generate random graphs using CL, BTER, and SBM models. First, we present an efficient sequential algorithm and an efficient distributed-memory parallel algorithm for the CL model. Our sequential algorithm takes O(m) time and O(Λ) space, where m and Λ are the number of edges and distinct degrees, and our parallel algorithm takes O (m/p + Λ + P) time w.h.p. and O(Λ) space using P processors. These algorithms are almost time optimal since any sequential and parallel algorithms need at least Ω(m) and Ω(m/p) time, respectively. Our algorithms outperform the best known previous algorithms by a significant margin in terms of both time and space. Experimental results on various large-scale networks show that both of our sequential and parallel algorithms require 400-15000 times less memory than the existing sequential and parallel algorithms, respectively, making our algorithms suitable for generating very large-scale networks. Moreover, both of our algorithms are about 3-4 times faster than the existing sequential and parallel algorithms. Finally, we show how our algorithmic method also leads to efficient parallel and sequential algorithms for the SBM and BTER models.
KW - Distributed computing
KW - Network theory
KW - Parallel programming
KW - Random graphs
UR - http://www.scopus.com/inward/record.url?scp=85017215570&partnerID=8YFLogxK
U2 - 10.1109/SC.2016.31
DO - 10.1109/SC.2016.31
M3 - Conference contribution
AN - SCOPUS:85017215570
T3 - International Conference for High Performance Computing, Networking, Storage and Analysis, SC
SP - 372
EP - 383
BT - Proceedings of SC 2016
PB - IEEE Computer Society
T2 - 2016 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2016
Y2 - 13 November 2016 through 18 November 2016
ER -