Unsupervised Sentiment Analysis of Plastic Surgery Social Media Posts

07/05/2023
by   Alexandrea K. Ramnarine, et al.
0

The massive collection of user posts across social media platforms is primarily untapped for artificial intelligence (AI) use cases based on the sheer volume and velocity of textual data. Natural language processing (NLP) is a subfield of AI that leverages bodies of documents, known as corpora, to train computers in human-like language understanding. Using a word ranking method, term frequency-inverse document frequency (TF-IDF), to create features across documents, it is possible to perform unsupervised analytics, machine learning (ML) that can group the documents without a human manually labeling the data. For large datasets with thousands of features, t-distributed stochastic neighbor embedding (t-SNE), k-means clustering and Latent Dirichlet allocation (LDA) are employed to learn top words and generate topics for a Reddit and Twitter combined corpus. Using extremely simple deep learning models, this study demonstrates that the applied results of unsupervised analysis allow a computer to predict either negative, positive, or neutral user sentiment towards plastic surgery based on a tweet or subreddit post with almost 90 accuracy. Furthermore, the model is capable of achieving higher accuracy on the unsupervised sentiment task than on a rudimentary supervised document classification task. Therefore, unsupervised learning may be considered a viable option in labeling social media documents for NLP tasks.

READ FULL TEXT

page 4

page 5

page 12

page 16

page 19

research
07/04/2017

Sentiment Identification in Code-Mixed Social Media Text

Sentiment analysis is the Natural Language Processing (NLP) task dealing...
research
11/09/2015

Sentiment Expression via Emoticons on Social Media

Emoticons (e.g., :) and :( ) have been widely used in sentiment analysis...
research
06/05/2020

Sentiment Analysis Based on Deep Learning: A Comparative Study

The study of public opinion can provide us with valuable information. Th...
research
05/25/2023

Mapping ChatGPT in Mainstream Media: Early Quantitative Insights through Sentiment Analysis and Word Frequency Analysis

The exponential growth in user acquisition and popularity of ChatGPT, an...
research
05/04/2023

Curating corpora with classifiers: A case study of clean energy sentiment online

Well curated, large-scale corpora of social media posts containing broad...
research
11/21/2016

Unsupervised Learning for Lexicon-Based Classification

In lexicon-based classification, documents are assigned labels by compar...
research
03/08/2022

Which side are you on? Insider-Outsider classification in conspiracy-theoretic social media

Social media is a breeding ground for threat narratives and related cons...

Please sign up or login with your details

Forgot password? Click here to reset