Geo-Text Data and Data-Driven Geospatial Semantics

09/15/2018
by   Yingjie Hu, et al.
0

Many datasets nowadays contain links between geographic locations and natural language texts. These links can be geotags, such as geotagged tweets or geotagged Wikipedia pages, in which location coordinates are explicitly attached to texts. These links can also be place mentions, such as those in news articles, travel blogs, or historical archives, in which texts are implicitly connected to the mentioned places. This kind of data is referred to as geo-text data. The availability of large amounts of geo-text data brings both challenges and opportunities. On the one hand, it is challenging to automatically process this kind of data due to the unstructured texts and the complex spatial footprints of some places. On the other hand, geo-text data offers unique research opportunities through the rich information contained in texts and the special links between texts and geography. As a result, geo-text data facilitates various studies especially those in data-driven geospatial semantics. This paper discusses geo-text data and related concepts. With a focus on data-driven research, this paper systematically reviews a large number of studies that have discovered multiple types of knowledge from geo-text data. Based on the literature review, a generalized workflow is extracted and key challenges for future work are discussed.

READ FULL TEXT

page 8

page 12

research
02/12/2019

WikiLinkGraphs: A complete, longitudinal and multi-language dataset of the Wikipedia link networks

Wikipedia articles contain multiple links connecting a subject to other ...
research
09/17/2022

How can voting mechanisms improve the robustness and generalizability of toponym disambiguation?

A vast amount of geographic information exists in natural language texts...
research
05/11/2018

iLCM - A Virtual Research Infrastructure for Large-Scale Qualitative Data

The iLCM project pursues the development of an integrated research envir...
research
04/03/2022

Pragmatic constraints and pronoun reference disambiguation: the possible and the impossible

Pronoun disambiguation in understanding text and discourse often require...
research
02/04/2020

From Topic Networks to Distributed Cognitive Maps: Zipfian Topic Universes in the Area of Volunteered Geographic Information

Are nearby places (e.g. cities) described by related words? In this arti...
research
12/07/2020

Stylometry for Noisy Medieval Data: Evaluating Paul Meyer's Hagiographic Hypothesis

Stylometric analysis of medieval vernacular texts is still a significant...
research
07/04/2022

Location reference recognition from texts: A survey and comparison

A vast amount of location information exists in unstructured texts, such...

Please sign up or login with your details

Forgot password? Click here to reset