"I can't keep it up anymore." The Voat.co dataset

01/15/2022
by   Amin Mekacher, et al.
0

Voat was a news aggregator website that shut down on December 25, 2020. The site had a troubled history and was known for hosting various banned subreddits. This paper presents a dataset with over 2.3M submissions and 16.2M comments posted from 113K users in 7.1K subverses (the equivalent of subreddit for Voat). Our dataset covers the whole lifetime of Voat, from its developing period starting on November 8, 2013, the day it was founded, April 2014, up until the day it shut down (December 25, 2020). This work presents the largest and most complete publicly available Voat dataset, to the best of our knowledge. We also present a preliminary analysis to cover posting activity and daily user and subverse registration on the platform so that researchers interested in our dataset can know what to expect. Our data may prove helpful to false news dissemination studies as we analyze the links users share on the platform, finding that many communities rely on alternative news press, like Breitbart and GatewayPundit, for their daily discussions. Last, we perform network analysis on user interactions finding that many users prefer not to interact with subverses outside their narrative interests, which could be helpful to researchers focusing on polarization and echo chambers. Also, since Voat was one of the platforms many Reddit users migrated to after a ban, we are confident that our dataset will motivate and assist researchers studying deplatforming. In addition, many hateful and conspiratorial communities seem to be very popular on Voat, which makes our work valuable for researchers focusing on toxicity, conspiracy theories, cross-platform studies of social networks, and natural language processing.

READ FULL TEXT
research
01/21/2020

Raiders of the Lost Kek: 3.5 Years of Augmented 4chan Posts from the Politically Incorrect Board

This paper presents a dataset with over 3.3M threads and 134.5M posts fr...
research
01/23/2020

The Pushshift Telegram Dataset

Messaging platforms, especially those with a mobile focus, have become i...
research
12/01/2022

FullBrain: a Social E-learning Platform

We present FullBrain, a social e-learning platform where students share ...
research
01/11/2021

An Early Look at the Parler Online Social Network

Parler is as an "alternative" social network promoting itself as a servi...
research
11/16/2018

Homogeneity-Based Transmissive Process to Model True and False News in Social Networks

An overwhelming number of true and false news stories are posted and sha...
research
09/10/2020

"Is it a Qoincidence?": A First Step Towards Understanding and Characterizing the QAnon Movement on Voat.co

Online fringe communities offer fertile grounds for users to seek and sh...
research
08/19/2021

Unsupervised Topic Discovery in User Comments

On social media platforms like Twitter, users regularly share their opin...

Please sign up or login with your details

Forgot password? Click here to reset