3MASSIV: Multilingual, Multimodal and Multi-Aspect dataset of Social Media Short Videos

03/28/2022
by   Vikram Gupta, et al.
2

We present 3MASSIV, a multilingual, multimodal and multi-aspect, expertly-annotated dataset of diverse short videos extracted from short-video social media platform - Moj. 3MASSIV comprises of 50k short videos (20 seconds average duration) and 100K unlabeled videos in 11 different languages and captures popular short video trends like pranks, fails, romance, comedy expressed via unique audio-visual formats like self-shot videos, reaction videos, lip-synching, self-sung songs, etc. 3MASSIV presents an opportunity for multimodal and multilingual semantic understanding on these unique videos by annotating them for concepts, affective states, media types, and audio language. We present a thorough analysis of 3MASSIV and highlight the variety and unique aspects of our dataset compared to other contemporary popular datasets with strong baselines. We also show how the social media content in 3MASSIV is dynamic and temporal in nature, which can be used for semantic understanding tasks and cross-lingual analysis.

READ FULL TEXT

page 1

page 5

page 6

page 7

page 16

research
10/13/2020

BRUMS at SemEval-2020 Task 12 : Transformer based Multilingual Offensive Language Identification in Social Media

In this paper, we describe the team BRUMS entry to OffensEval 2: Multili...
research
07/14/2022

Estimating Emotion Contagion on Social Media via Localized Diffusion in Dynamic Graphs

We present a computational approach for estimating emotion contagion on ...
research
04/03/2022

Multilingual and Multimodal Abuse Detection

The presence of abusive content on social media platforms is undesirable...
research
03/09/2023

Seeing ChatGPT Through Students' Eyes: An Analysis of TikTok Data

Advanced large language models like ChatGPT have gained considerable att...
research
03/31/2016

The Open World of Micro-Videos

Micro-videos are six-second videos popular on social media networks with...
research
11/03/2020

Content-based Analysis of the Cultural Differences between TikTok and Douyin

Short-form video social media shifts away from the traditional media par...
research
04/12/2023

AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection

The short-form videos have explosive popularity and have dominated the n...

Please sign up or login with your details

Forgot password? Click here to reset