SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild

by   Jean Kossaifi, et al.

Natural human-computer interaction and audio-visual human behaviour sensing systems, which would achieve robust performance in-the-wild are more needed than ever as digital devices are becoming indispensable part of our life more and more. Accurately annotated real-world data are the crux in devising such systems. However, existing databases usually consider controlled settings, low demographic variability, and a single task. In this paper, we introduce the SEWA database of more than 2000 minutes of audio-visual data of 398 people coming from six cultures, 50 18 to 65 years old. Subjects were recorded in two different contexts: while watching adverts and while discussing adverts in a video chat. The database includes rich annotations of the recordings in terms of facial landmarks, facial action units (FAU), various vocalisations, mirroring, and continuously valued valence, arousal, liking, agreement, and prototypic examples of (dis)liking. This database aims to be an extremely valuable resource for researchers in affective computing and automatic human sensing and is expected to push forward the research in human behaviour analysis, including cultural studies. Along with the database, we provide extensive baseline experiments for automatic FAU detection and automatic valence, arousal and (dis)liking intensity estimation.


page 7

page 8

page 9

page 10


RAF-AU Database: In-the-Wild Facial Expressions with Subjective Emotion Judgement and Objective AU Annotations

Much of the work on automatic facial expression recognition relies on da...

Aff-Wild2: Extending the Aff-Wild Database for Affect Recognition

Automatic understanding of human affect using visual signals is a proble...

MAFW: A Large-scale, Multi-modal, Compound Affective Database for Dynamic Facial Expression Recognition in the Wild

Dynamic facial expression recognition (FER) databases provide important ...

A Multi-Task Learning & Generation Framework: Valence-Arousal, Action Units & Primary Expressions

Over the past few years many research efforts have been devoted to the f...

Cross-Cultural and Cultural-Specific Production and Perception of Facial Expressions of Emotion in the Wild

Automatic recognition of emotion from facial expressions is an intense a...

Prior Aided Streaming Network for Multi-task Affective Recognitionat the 2nd ABAW2 Competition

Automatic affective recognition has been an important research topic in ...

PTSD in the Wild: A Video Database for Studying Post-Traumatic Stress Disorder Recognition in Unconstrained Environments

POST-traumatic stress disorder (PTSD) is a chronic and debilitating ment...

Please sign up or login with your details

Forgot password? Click here to reset