A Semantics-Based Measure of Emoji Similarity

07/14/2017
by   Sanjaya Wijeratne, et al.
0

Emoji have grown to become one of the most important forms of communication on the web. With its widespread use, measuring the similarity of emoji has become an important problem for contemporary text processing since it lies at the heart of sentiment analysis, search, and interface design tasks. This paper presents a comprehensive analysis of the semantic similarity of emoji through embedding models that are learned over machine-readable emoji meanings in the EmojiNet knowledge base. Using emoji descriptions, emoji sense labels and emoji sense definitions, and with different training corpora obtained from Twitter and Google News, we develop and test multiple embedding models to measure emoji similarity. To evaluate our work, we create a new dataset called EmoSim508, which assigns human-annotated semantic similarity scores to a set of 508 carefully selected emoji pairs. After validation with EmoSim508, we present a real-world use-case of our emoji embedding models using a sentiment analysis task and show that our models outperform the previous best-performing emoji embedding model on this task. The EmoSim508 dataset and our emoji embedding models are publicly released with this paper and can be downloaded from http://emojinet.knoesis.org/.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/04/2022

An LSTM model for Twitter Sentiment Analysis

Sentiment analysis on social media such as Twitter provides organization...
research
07/14/2017

EmojiNet: An Open Service and API for Emoji Sense Discovery

This paper presents the release of EmojiNet, the largest machine-readabl...
research
07/09/2018

Towards Enhancing Lexical Resource and Using Sense-annotations of OntoSenseNet for Sentiment Analysis

This paper illustrates the interface of the tool we developed for crowd ...
research
07/14/2017

Developing a concept-level knowledge base for sentiment analysis in Singlish

In this paper, we present Singlish sentiment lexicon, a concept-level kn...
research
04/22/2020

Preserving the Hypernym Tree of WordNet in Dense Embeddings

In this paper, we provide a novel way to generate low-dimension (dense) ...
research
08/30/2019

Keep Calm and Switch On! Preserving Sentiment and Fluency in Semantic Text Exchange

In this paper, we present a novel method for measurably adjusting the se...
research
10/25/2016

EmojiNet: Building a Machine Readable Sense Inventory for Emoji

Emoji are a contemporary and extremely popular way to enhance electronic...

Please sign up or login with your details

Forgot password? Click here to reset