Efficient Distributed SPARQL Queries on Apache Spark

Saleh Albahli

doi:10.14569/ijacsa.2019.0100874

Back

Efficient Distributed SPARQL Queries on Apache Spark

Journal article

Open access

Efficient Distributed SPARQL Queries on Apache Spark

Saleh Albahli

International journal of advanced computer science & applications, Vol.10(8), pp.564-568

2019

DOI: https://doi.org/10.14569/ijacsa.2019.0100874

Abstract

Computer Science

Computer Science, Theory & Methods

Science & Technology

Technology

RDF is a widely-accepted framework for describing metadata in the web due to its simplicity and universal graph-like data model. Owing to the abundance of RDF data, existing query techniques are rendered unsuitable. To this direction, we adopt the processing power of Apache Spark to load and query a large dataset much more quickly than classical approaches. In this paper, we have designed experiments to evaluate the performance of several queries ranging from single attribute selection to selection, filtering and sorting multiple attributes in the dataset. We further experimented with the performance of queries using distributed SPARQL query on Apache Spark GraphX and studied different stages involved in this pipeline. The execution of distributed SPARQL query on Apache Spark GraphX helped us study its performance and gave insights into which stages of the pipeline can be improved. The query pipeline comprised of Graph loading, Basic Graph Pattern and Result calculating. Our goal is to minimize the time during graph loading stage in order to improve overall performance and cut the costs of data loading.

Files and links (1)

url

https://doi.org/10.14569/ijacsa.2019.0100874View

Published (Version of record) Open

Metrics

1 Record Views

Details

Title: Efficient Distributed SPARQL Queries on Apache Spark
Creators - without role: Saleh Albahli - Qassim University
Publication Details: International journal of advanced computer science & applications, Vol.10(8), pp.564-568
Publisher: Science & Information Sai Organization Ltd
Number of pages: 5
Identifiers: 9928643808331
Academic Unit: Qassim University
Language: English
Resource Type: Journal article