Study of sampling methods in sentiment analysis of imbalanced data

06/12/2021
by   Zeeshan Ali Sayyed, et al.
0

This work investigates the application of sampling methods for sentiment analysis on two different highly imbalanced datasets. One dataset contains online user reviews from the cooking platform Epicurious and the other contains comments given to the Planned Parenthood organization. In both these datasets, the classes of interest are rare. Word n-grams were used as features from these datasets. A feature selection technique based on information gain is first applied to reduce the number of features to a manageable space. A number of different sampling methods were then applied to mitigate the class imbalance problem which are then analyzed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/27/2014

Persian Sentiment Analyzer: A Framework based on a Novel Feature Selection Method

In the recent decade, with the enormous growth of digital content in int...
research
12/23/2021

Making sense of electrical vehicle discussions using sentiment analysis on closely related news and user comments

We used a token-wise and document-wise sentiment analysis using both uns...
research
09/15/2021

Dialog speech sentiment classification for imbalanced datasets

Speech is the most common way humans express their feelings, and sentime...
research
06/05/2019

Prediction of Workplace Injuries

Workplace injuries result in substantial human and financial losses. As ...
research
09/25/2020

Empirical Study of Text Augmentation on Social Media Text in Vietnamese

In the text classification problem, the imbalance of labels in datasets ...
research
07/11/2022

Partial Resampling of Imbalanced Data

Imbalanced data is a frequently encountered problem in machine learning....
research
09/04/2017

From Review to Rating: Exploring Dependency Measures for Text Classification

Various text analysis techniques exist, which attempt to uncover unstruc...

Please sign up or login with your details

Forgot password? Click here to reset