That's sick dude!: Automatic identification of word sense change across different timescales

by   Sunny Mitra, et al.

In this paper, we propose an unsupervised method to identify noun sense changes based on rigorous analysis of time-varying text data available in the form of millions of digitized books. We construct distributional thesauri based networks from data at different time points and cluster each of them separately to obtain word-centric sense clusters corresponding to the different time points. Subsequently, we compare these sense clusters of two different time points to find if (i) there is birth of a new sense or (ii) if an older sense has got split into more than one sense or (iii) if a newer sense has been formed from the joining of older senses or (iv) if a particular sense has died. We conduct a thorough evaluation of the proposed methodology both manually as well as through comparison with WordNet. Manual evaluation indicates that the algorithm could correctly identify 60.4 picked samples and 57 samples. Remarkably, in 44 WordNet, while in 46 confirmed by WordNet. Our approach can be applied for lexicography, as well as for applications like word sense disambiguation or semantic search.


page 1

page 2

page 3

page 4


Adapting predominant and novel sense discovery algorithms for identifying corpus-specific sense differences

Word senses are not static and may have temporal, spatial or corpus-spec...

Detecting Reliable Novel Word Senses: A Network-Centric Approach

In this era of Big Data, due to expeditious exchange of information on t...

An Unsupervised Word Sense Disambiguation System for Under-Resourced Languages

In this paper, we present Watasense, an unsupervised system for word sen...

SCoT: Sense Clustering over Time: a tool for the analysis of lexical change

We present Sense Clustering over Time (SCoT), a novel network-based tool...

A Knowledge-Based Approach to Word Sense Disambiguation by distributional selection and semantic features

Word sense disambiguation improves many Natural Language Processing (NLP...

WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words in Context

In this paper, we present WiC-TSV (Target Sense Verification for Words i...

Towards Automation of Sense-type Identification of Verbs in OntoSenseNet(Telugu)

In this paper, we discuss the enrichment of a manually developed resourc...

Please sign up or login with your details

Forgot password? Click here to reset