Facial Expression Recognition with Swin Transformer

03/25/2022
by   Jun-Hwa Kim, et al.
0

The task of recognizing human facial expressions plays a vital role in various human-related systems, including health care and medical fields. With the recent success of deep learning and the accessibility of a large amount of annotated data, facial expression recognition research has been mature enough to be utilized in real-world scenarios with audio-visual datasets. In this paper, we introduce Swin transformer-based facial expression approach for an in-the-wild audio-visual dataset of the Aff-Wild2 Expression dataset. Specifically, we employ a three-stream network (i.e., Visual stream, Temporal stream, and Audio stream) for the audio-visual videos to fuse the multi-modal information into facial expression recognition. Experimental results on the Aff-Wild2 dataset show the effectiveness of our proposed multi-modal approaches.

READ FULL TEXT

page 2

page 3

research
03/15/2023

Multi-Modal Facial Expression Recognition with Transformer-Based Fusion Networks and Dynamic Sampling

Facial expression recognition is important for various purpose such as e...
research
08/25/2023

Prompting Visual-Language Models for Dynamic Facial Expression Recognition

This paper presents a novel visual-language model called DFER-CLIP, whic...
research
10/26/2021

ViDA-MAN: Visual Dialog with Digital Humans

We demonstrate ViDA-MAN, a digital-human agent for multi-modal interacti...
research
06/02/2021

Domain Adaptation for Facial Expression Classifier via Domain Discrimination and Gradient Reversal

Bringing empathy to a computerized system could significantly improve th...
research
09/17/2021

Expression Snippet Transformer for Robust Video-based Facial Expression Recognition

The recent success of Transformer has provided a new direction to variou...
research
03/23/2023

FER-former: Multi-modal Transformer for Facial Expression Recognition

The ever-increasing demands for intuitive interactions in Virtual Realit...
research
11/30/2020

Detecting expressions with multimodal transformers

Developing machine learning algorithms to understand person-to-person en...

Please sign up or login with your details

Forgot password? Click here to reset