Knowledge-Augmented Language Model and its Application to Unsupervised Named-Entity Recognition

04/09/2019
by   Angli Liu, et al.
0

Traditional language models are unable to efficiently model entity names observed in text. All but the most popular named entities appear infrequently in text providing insufficient context. Recent efforts have recognized that context can be generalized between entity names that share the same type (e.g., person or location) and have equipped language models with access to an external knowledge base (KB). Our Knowledge-Augmented Language Model (KALM) continues this line of work by augmenting a traditional model with a KB. Unlike previous methods, however, we train with an end-to-end predictive objective optimizing the perplexity of text. We do not require any additional information such as named entity tags. In addition to improving language modeling performance, KALM learns to recognize named entities in an entirely unsupervised way by using entity type information latent in the model. On a Named Entity Recognition (NER) task, KALM achieves performance comparable with state-of-the-art supervised models. Our work demonstrates that named entities (and possibly other types of world knowledge) can be modeled successfully using predictive learning and training on large corpora of text without any additional information.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2019

Man is to Person as Woman is to Location: Measuring Gender Bias in Named Entity Recognition

We study the bias in several state-of-the-art named entity recognition (...
research
05/13/2018

Building Language Models for Text with Named Entities

Text in many domains involves a significant amount of named entities. Pr...
research
01/25/2022

Distantly supervised end-to-end medical entity extraction from electronic health records with human-level quality

Medical entity extraction (EE) is a standard procedure used as a first s...
research
04/15/2020

Entities as Experts: Sparse Memory Access with Entity Supervision

We focus on the problem of capturing declarative knowledge in the learne...
research
04/16/2022

TASTEset – Recipe Dataset and Food Entities Recognition Benchmark

Food Computing is currently a fast-growing field of research. Natural la...
research
11/26/2018

Scalable graph-based individual named entity identification

Named entity discovery (NED) is an important information retrieval probl...
research
04/27/2021

Named Entity Recognition and Linking Augmented with Large-Scale Structured Data

In this paper we describe our submissions to the 2nd and 3rd SlavNER Sha...

Please sign up or login with your details

Forgot password? Click here to reset