Ranking Archived Documents for Structured Queries on Semantic Layers

10/23/2018
by   Pavlos Fafalios, et al.
0

Archived collections of documents (like newspaper and web archives) serve as important information sources in a variety of disciplines, including Digital Humanities, Historical Science, and Journalism. However, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into usable sources of information. A semantic layer is an RDF graph that describes metadata and semantic information about a collection of archived documents, which in turn can be queried through a semantic query language (SPARQL). This allows running advanced queries by combining metadata of the documents (like publication date) and content-based semantic information (like entities mentioned in the documents). However, the results returned by such structured queries can be numerous and moreover they all equally match the query. In this paper, we deal with this problem and formalize the task of "ranking archived documents for structured queries on semantic layers". Then, we propose two ranking models for the problem at hand which jointly consider: i) the relativeness of documents to entities, ii) the timeliness of documents, and iii) the temporal relations among the entities. The experimental results on a new evaluation dataset show the effectiveness of the proposed models and allow us to understand their limitations

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/23/2018

Towards a Ranking Model for Semantic Layers over Digital Archives

Archived collections of documents (like newspaper archives) serve as imp...
research
10/24/2018

Building and Querying Semantic Layers for Web Archives (Extended Version)

Web archiving is the process of collecting portions of the Web to ensure...
research
11/29/2011

An Enhanced Indexing And Ranking Technique On The Semantic Web

With the fast growth of the Internet, more and more information is avail...
research
05/19/2023

QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set Operations

Formulating selective information needs results in queries that implicit...
research
06/23/2016

Toward a Deep Neural Approach for Knowledge-Based IR

This paper tackles the problem of the semantic gap between a document an...
research
09/03/2019

Finding Salient Context based on Semantic Matching for Relevance Ranking

In this paper, we propose a salient-context based semantic matching meth...
research
05/03/2020

Guided Link-Traversal-Based Query Processing

Link-Traversal-Based Query Processing (LTBQP) is a technique for evaluat...

Please sign up or login with your details

Forgot password? Click here to reset