Reddit Entity Linking Dataset

01/04/2021
by   Nicholas Botzer, et al.
0

We introduce and make publicly available an entity linking dataset from Reddit that contains17,316 linked entities, each annotated by three human annotators and then grouped into Gold, Silver, and Bronze to indicate inter-annotator agreement. We analyze the different errors and disagreements made by annotators and suggest three types of corrections to the raw data. Finally, we tested existing entity linking models that are trained and tuned on text from non-social media datasets. We find that, although these existing entity linking models perform very well on their original datasets, they perform poorly on this social media dataset. We also show that the majority of these errors can be attributed to poor performance on the mention detection subtask. These results indicate the need for better entity linking models that can be applied to the enormous amount of social media text.

READ FULL TEXT
research
04/22/2020

ParsEL 1.0: Unsupervised Entity Linking in Persian Social Media Texts

In recent years, social media data has exponentially increased, which ca...
research
08/16/2021

MobIE: A German Dataset for Named Entity Recognition, Entity Linking and Relation Extraction in the Mobility Domain

We present MobIE, a German-language dataset, which is human-annotated wi...
research
01/13/2015

Towards Deep Semantic Analysis Of Hashtags

Hashtags are semantico-syntactic constructs used across various social n...
research
05/11/2021

Conversational Entity Linking: Problem Definition and Datasets

Machine understanding of user utterances in conversational systems is of...
research
10/07/2020

COMETA: A Corpus for Medical Entity Linking in the Social Media

Whilst there has been growing progress in Entity Linking (EL) for genera...
research
05/15/2021

A Deep Metric Learning Approach to Account Linking

We consider the task of linking social media accounts that belong to the...
research
05/22/2022

TWEET-FID: An Annotated Dataset for Multiple Foodborne Illness Detection Tasks

Foodborne illness is a serious but preventable public health problem – w...

Please sign up or login with your details

Forgot password? Click here to reset