Accelerating Partial Evaluation in Distributed SPARQL Query Evaluation

02/11/2019
by   Peng Peng, et al.
0

Partial evaluation has recently been used for processing SPARQL queries over a large resource description framework (RDF) graph in a distributed environment. However, the previous approach is inefficient when dealing with complex queries. In this study, we further improve the "partial evaluation and assembly" framework for answering SPARQL queries over a distributed RDF graph, while providing performance guarantees. Our key idea is to explore the intrinsic structural characteristics of partial matches to filter out irrelevant partial results, while providing performance guarantees on a network trace (data shipment) or the computational cost (response time). We also propose an efficient assembly algorithm to utilize the characteristics of partial matches to merge them and form final results. To improve the efficiency of finding partial matches further, we propose an optimization that communicates variables' candidates among sites to avoid redundant computations. In addition, although our approach is partitioning-tolerant, different partitioning strategies result in different performances, and we evaluate different partitioning strategies for our approach. Experiments over both real and synthetic RDF datasets confirm the superiority of our approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/22/2021

Load Balanced Semantic Aware Distributed RDF Graph

The modern day semantic applications store data as Resource Description ...
research
09/09/2022

Machine Learning-based Selection of Graph Partitioning Strategy Using the Characteristics of Graph Data and Algorithm

Analyzing large graph data is an essential part of many modern applicati...
research
03/28/2022

WawPart: Workload-Aware Partitioning of Knowledge Graphs

Large-scale datasets in the form of knowledge graphs are often used in n...
research
09/12/2022

Efficient query evaluation techniques over large amount of distributed linked data

As RDF becomes more widely established and the amount of linked data is ...
research
11/24/2021

Distributed Evaluation of Graph Queries using Recursive Relational Algebra

We present a system called Dist-μ-RA for the distributed evaluation of r...
research
02/11/2020

pSPICE: Partial Match Shedding for Complex Event Processing

Complex event processing (CEP) systems continuously process input event ...
research
04/23/2022

BatchHL: Answering Distance Queries on Batch-Dynamic Networks at Scale

Many real-world applications operate on dynamic graphs that undergo rapi...

Please sign up or login with your details

Forgot password? Click here to reset