Building and Querying Semantic Layers for Web Archives (Extended Version)

10/24/2018
by   Pavlos Fafalios, et al.
0

Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles ("layers") that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts and events), and publishing all this data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities, and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.

READ FULL TEXT
research
10/23/2018

Ranking Archived Documents for Structured Queries on Semantic Layers

Archived collections of documents (like newspaper and web archives) serv...
research
03/05/2016

A Linked Data Scalability Challenge: Concept Reuse Leads to Semantic Decay

The increasing amount of available Linked Data resources is laying the f...
research
10/23/2018

Towards a Ranking Model for Semantic Layers over Digital Archives

Archived collections of documents (like newspaper archives) serve as imp...
research
12/17/2013

Semantic Annotation: The Mainstay of Semantic Web

Given that semantic Web realization is based on the critical mass of met...
research
11/05/2017

Semantic Web Today: From Oil Rigs to Panama Papers

The next leap on the internet has already started as Semantic Web. At it...
research
02/19/2016

Ordonnancement d'entités pour la rencontre du web des documents et du web des données

The advances of the Linked Open Data (LOD) initiative are giving rise to...
research
06/12/2020

High-Level ETL for Semantic Data Warehouses—Full Version

The popularity of the Semantic Web (SW) encourages organizations to orga...

Please sign up or login with your details

Forgot password? Click here to reset