How Many and What Types of SPARQL Queries can be Answered through Zero-Knowledge Link Traversal?

by   Pavlos Fafalios, et al.

The current de-facto way to query the Web of Data is through the SPARQL protocol, where a client sends queries to a server through a SPARQL endpoint. Contrary to an HTTP server, providing and maintaining a robust and reliable endpoint requires a significant effort that not all publishers are willing or able to make. An alternative query evaluation method is through link traversal, where a query is answered by dereferencing online web resources (URIs) at real time. While several approaches for such a lookup-based query evaluation method have been proposed, there exists no analysis of the types (patterns) of queries that can be directly answered on the live Web, without accessing local or remote endpoints and without a-priori knowledge of available data sources. In this paper, we first provide a method for checking if a SPARQL query (to be evaluated on a SPARQL endpoint) can be answered through zero-knowledge link traversal (without accessing the endpoint), and analyse a large corpus of real SPARQL query logs for finding the frequency and distribution of answerable and non-answerable query patterns. Subsequently, we provide an algorithm for transforming answerable queries to SPARQL-LD queries that bypass the endpoints. We report experimental results about the efficiency of the transformed queries and discuss the benefits and the limitations of this query evaluation method.


page 1

page 2

page 3

page 4


Heuristics-based Query Reordering for Federated Queries in SPARQL 1.1 and SPARQL-LD

The federated query extension of SPARQL 1.1 allows executing queries dis...

Estimating the Cost of Executing Link Traversal based SPARQL Queries

An increasing number of organisations in almost all fields have started ...

An Analytical Study of Large SPARQL Query Logs

With the adoption of RDF as the data model for Linked Data and the Seman...

Understanding DNS Query Composition at B-Root

The Domain Name System (DNS) is part of critical internet infrastructure...

Distributed Subweb Specifications for Traversing the Web

Link Traversal-based Query Processing (ltqp), in which a sparql query is...

Querying Linked Data: An Experimental Evaluation of State-of-the-Art Interfaces

The adoption of Semantic Web technologies, and in particular the Open Da...

Evaluation of Link Traversal Query Execution over Decentralized Environments with Structural Assumptions

To counter societal and economic problems caused by data silos on the We...

Please sign up or login with your details

Forgot password? Click here to reset