Vision Transformer for Action Units Detection

03/16/2023
by   Tu Vu, et al.
0

Facial Action Units detection (FAUs) represents a fine-grained classification problem that involves identifying different units on the human face, as defined by the Facial Action Coding System. In this paper, we present a simple yet efficient Vision Transformer-based approach for addressing the task of Action Units (AU) detection in the context of Affective Behavior Analysis in-the-wild (ABAW) competition. We employ the Video Vision Transformer(ViViT) Network to capture the temporal facial change in the video. Besides, to reduce massive size of the Vision Transformers model, we replace the ViViT feature extraction layers with the CNN backbone (Regnet). Our model outperform the baseline model of ABAW 2023 challenge, with a notable 14% difference in result. Furthermore, the achieved results are comparable to those of the top three teams in the previous ABAW 2022 challenge.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2022

Multi-modal Multi-label Facial Action Unit Detection with Transformer

Facial Action Coding System is an important approach of facial expressio...
research
11/12/2022

End-to-End Machine Learning Framework for Facial AU Detection in Intensive Care Units

Pain is a common occurrence among patients admitted to Intensive Care Un...
research
02/22/2021

Deepfake Video Detection Using Convolutional Vision Transformer

The rapid advancement of deep learning models that can generate and synt...
research
04/28/2023

Towards Automated Circuit Discovery for Mechanistic Interpretability

Recent work in mechanistic interpretability has reverse-engineered nontr...
research
02/04/2020

Multi-label Relation Modeling in Facial Action Units Detection

This paper describes an approach to the facial action units detections. ...
research
06/02/2023

Backchannel Detection and Agreement Estimation from Video with Transformer Networks

Listeners use short interjections, so-called backchannels, to signify at...
research
07/07/2021

Action Units Recognition Using Improved Pairwise Deep Architecture

Facial Action Units (AUs) represent a set of facial muscular activities ...

Please sign up or login with your details

Forgot password? Click here to reset