Abstract
The Resource Description Framework (RDF) and SPARQL Protocol and RDF Query Language (SPARQL) were introduced about a decade ago to enable flexible schema-free data interchange on the Semantic Web. Today, data scientists use the framework as a scalable graph representation for integrating, querying, exploring and analyzing data sets hosted at different sources. With increasing adoption, the need for graph mining capabilities for the Semantic Web has emerged. We address that need through implementation of three popular iterative Graph Mining algorithms (Triangle count, Connected component analysis, and PageRank). We implement these algorithms as SPARQL queries, wrapped within Python scripts. We evaluate the performance of our implementation on 6 real world data sets and show graph mining algorithms (that have a linear-Algebra formulation) can indeed be unleashed on data represented as RDF graphs using the SPARQL query interface.
Original language | English |
---|---|
Title of host publication | ICDEW 2015 - 2015 IEEE 31st International Conference on Data Engineering Workshops |
Publisher | IEEE Computer Society |
Pages | 53-58 |
Number of pages | 6 |
ISBN (Electronic) | 9781479984411 |
DOIs | |
State | Published - Jun 19 2015 |
Event | 2015 31st IEEE International Conference on Data Engineering Workshops, ICDEW 2015 - Seoul, Korea, Republic of Duration: Apr 13 2015 → Apr 17 2015 |
Publication series
Name | Proceedings - International Conference on Data Engineering |
---|---|
Volume | 2015-June |
ISSN (Print) | 1084-4627 |
Conference
Conference | 2015 31st IEEE International Conference on Data Engineering Workshops, ICDEW 2015 |
---|---|
Country/Territory | Korea, Republic of |
City | Seoul |
Period | 04/13/15 → 04/17/15 |
Bibliographical note
Publisher Copyright:© 2015 IEEE.