Semi-supervised Bootstrapping approach for Named Entity Recognition

11/21/2015
by   S. Thenmalar, et al.
0

The aim of Named Entity Recognition (NER) is to identify references of named entities in unstructured documents, and to classify them into pre-defined semantic categories. NER often aids from added background knowledge in the form of gazetteers. However using such a collection does not deal with name variants and cannot resolve ambiguities associated in identifying the entities in context and associating them with predefined categories. We present a semi-supervised NER approach that starts with identifying named entities with a small set of training data. Using the identified named entities, the word and the context features are used to define the pattern. This pattern of each named entity category is used as a seed pattern to identify the named entities in the test set. Pattern scoring and tuple value score enables the generation of the new patterns to identify the named entity categories. We have evaluated the proposed system for English language with the dataset of tagged (IEER) and untagged (CoNLL 2003) named entity corpus and for Tamil language with the documents from the FIRE corpus and yield an average f-measure of 75 the languages.

READ FULL TEXT
research
06/12/2018

Named Entity Recognition with Extremely Limited Data

Traditional information retrieval treats named entity recognition as a p...
research
12/15/2021

Named entity recognition architecture combining contextual and global features

Named entity recognition (NER) is an information extraction technique th...
research
07/09/2018

Constructing a Word Similarity Graph from Vector based Word Representation for Named Entity Recognition

In this paper, we discuss a method for identifying a seed word that woul...
research
04/25/2019

Terminologies augmented recurrent neural network model for clinical named entity recognition

We aimed to enhance the performance of a supervised model for clinical n...
research
07/01/2020

Improving NER for Clinical Texts by Ensemble Approach using Segment Representations

Clinical Named Entity Recognition (Clinical-NER), which aims at identify...
research
10/22/2018

Named Entity Disambiguation using Deep Learning on Graphs

We tackle NED by comparing entities in short sentences with graphs. Cre...
research
02/18/2019

"The Michael Jordan of Greatness": Extracting Vossian Antonomasia from Two Decades of the New York Times, 1987-2007

Vossian Antonomasia is a prolific stylistic device, in use since antiqui...

Please sign up or login with your details

Forgot password? Click here to reset