Text Compression for Sentiment Analysis via Evolutionary Algorithms

09/20/2017
by   Emmanuel Dufourq, et al.
0

Can textual data be compressed intelligently without losing accuracy in evaluating sentiment? In this study, we propose a novel evolutionary compression algorithm, PARSEC (PARts-of-Speech for sEntiment Compression), which makes use of Parts-of-Speech tags to compress text in a way that sacrifices minimal classification accuracy when used in conjunction with sentiment analysis algorithms. An analysis of PARSEC with eight commercial and non-commercial sentiment analysis algorithms on twelve English sentiment data sets reveals that accurate compression is possible with (0 in sentiment classification accuracy for (20 PARSEC using LingPipe, the most accurate of the sentiment algorithms. Other sentiment analysis algorithms are more severely affected by compression. We conclude that significant compression of text data is possible for sentiment analysis depending on the accuracy demands of the specific application and the specific sentiment analysis algorithm used.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2018

Sentiment Analysis on Speaker Specific Speech Data

Sentiment analysis has evolved over past few decades, most of the work i...
research
04/05/2018

Automated Classification of Text Sentiment

The ability to identify sentiment in text, referred to as sentiment anal...
research
05/01/2020

Beneath the Tip of the Iceberg: Current Challenges and New Directions in Sentiment Analysis Research

Sentiment analysis as a field has come a long way since it was first int...
research
12/03/2017

Sentiment Classification using Images and Label Embeddings

In this project we analysed how much semantic information images carry, ...
research
05/01/2018

Word2Vec and Doc2Vec in Unsupervised Sentiment Analysis of Clinical Discharge Summaries

In this study, we explored application of Word2Vec and Doc2Vec for senti...
research
03/03/2021

EmoWrite: A Sentiment Analysis-Based Thought to Text Conversion

Brain Computer Interface (BCI) helps in processing and extraction of use...
research
08/01/2019

A compression based framework for the detection of anomalies in heterogeneous data sources

Nowadays, information and communications technology systems are fundamen...

Please sign up or login with your details

Forgot password? Click here to reset