Your "Attention" Deserves Attention: A Self-Diversified Multi-Channel Attention for Facial Action Analysis

by   Xiaotian Li, et al.

Visual attention has been extensively studied for learning fine-grained features in both facial expression recognition (FER) and Action Unit (AU) detection. A broad range of previous research has explored how to use attention modules to localize detailed facial parts (e,g. facial action units), learn discriminative features, and learn inter-class correlation. However, few related works pay attention to the robustness of the attention module itself. Through experiments, we found neural attention maps initialized with different feature maps yield diverse representations when learning to attend the identical Region of Interest (ROI). In other words, similar to general feature learning, the representational quality of attention maps also greatly affects the performance of a model, which means unconstrained attention learning has lots of randomnesses. This uncertainty lets conventional attention learning fall into sub-optimal. In this paper, we propose a compact model to enhance the representational and focusing power of neural attention maps and learn the "inter-attention" correlation for refined attention maps, which we term the "Self-Diversified Multi-Channel Attention Network (SMA-Net)". The proposed method is evaluated on two benchmark databases (BP4D and DISFA) for AU detection and four databases (CK+, MMI, BU-3DFE, and BP4D+) for facial expression recognition. It achieves superior performance compared to the state-of-the-art methods.


page 1

page 3

page 7


MGRR-Net: Multi-level Graph Relational Reasoning Network for Facial Action Units Detection

The Facial Action Coding System (FACS) encodes the action units (AUs) in...

Upper, Middle and Lower Region Learning for Facial Action Unit Detection

Facial action units (AUs) detection is fundamental to facial expression ...

Distract Your Attention: Multi-head Cross Attention Network for Facial Expression Recognition

We present a novel facial expression recognition network, called Distrac...

Attention Based Relation Network for Facial Action Units Recognition

Facial action unit (AU) recognition is essential to facial expression an...

Robust Facial Expression Recognition with Convolutional Visual Transformers

Facial Expression Recognition (FER) in the wild is extremely challenging...

EAC-Net: A Region-based Deep Enhancing and Cropping Approach for Facial Action Unit Detection

In this paper, we propose a deep learning based approach for facial acti...

Global-to-local Expression-aware Embeddings for Facial Action Unit Detection

Expressions and facial action units (AUs) are two levels of facial behav...

Please sign up or login with your details

Forgot password? Click here to reset