Graph Processing Platforms at Scale: Practices and Experiences

Seung Hwan Lim, Sangkeun Lee, Gautam Ganesh, Tyler C. Brown, Sreenivas R. Sukumar

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Scopus citations

Abstract

Graph analysis has revealed patterns and relationships hidden in data from a variety of domains such as transportation networks, social networks, clinical pathways, and collaboration networks. As these networks grow in size, variety and complexity, it is a challenge to find the right combination of tools and implementation of algorithms to discover new insights from the data. Addressing this challenge, our study presents an extensive empirical evaluation of three representative graph processing platforms: Pegasus, GraphX, and Urika. Each system represents a combination of options in data model, processing paradigm, and infrastructure. We benchmark each platform using three popular graph mining operations, degree distribution, connected components, and PageRank over real-world graphs. Our experiments show that each graph processing platform owns a particular strength for different types of graph operations. While Urika performs the best in non-iterative graph operations like degree distribution, GraphX outperforms iterative operations like connected components and PageRank. We conclude this paper by discussing options to optimize the performance of a graph-theoretic operation on each platform for large-scale real world graphs.

Original languageEnglish
Title of host publicationISPASS 2015 - IEEE International Symposium on Performance Analysis of Systems and Software
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages42-51
Number of pages10
ISBN (Electronic)9781479919567
DOIs
StatePublished - Apr 27 2015
Event2015 15th IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2015 - Philidelphia, United States
Duration: Mar 29 2015Mar 31 2015

Publication series

NameISPASS 2015 - IEEE International Symposium on Performance Analysis of Systems and Software

Conference

Conference2015 15th IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2015
Country/TerritoryUnited States
CityPhilidelphia
Period03/29/1503/31/15

Fingerprint

Dive into the research topics of 'Graph Processing Platforms at Scale: Practices and Experiences'. Together they form a unique fingerprint.

Cite this