Speech Driven Talking Face Generation from a Single Image and an Emotion Condition

08/08/2020
by   Sefik Emre Eskimez, et al.
0

Visual emotion expression plays an important role in audiovisual speech communication. In this work, we propose a novel approach to rendering visual emotion expression in speech-driven talking face generation. Specifically, we design an end-to-end talking face generation system that takes a speech utterance, a single face image, and a categorical emotion label as input to render a talking face video in sync with the speech and expressing the condition emotion. Objective evaluation on image quality, audiovisual synchronization, and visual emotion expression shows that the proposed system outperforms a state-of-the-art baseline system. Subjective evaluation of visual emotion expression and video realness also demonstrates the superiority of the proposed system. Furthermore, we conduct a pilot study on human emotion recognition of generated videos with mismatched emotions between the audio and visual modalities, and results show that humans reply on the visual modality more significantly than the audio modality on this task.

READ FULL TEXT

page 3

page 5

page 6

page 8

page 9

research
02/12/2020

An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos

Emotion recognition in user-generated videos plays an important role in ...
research
08/27/2019

EmoSense: Computational Intelligence Driven Emotion Sensing via Wireless Channel Data

Emotion is well-recognized as a distinguished symbol of human beings, an...
research
03/01/2023

READ Avatars: Realistic Emotion-controllable Audio Driven Avatars

We present READ Avatars, a 3D-based approach for generating 2D avatars t...
research
09/16/2021

Invertable Frowns: Video-to-Video Facial Emotion Translation

We present Wav2Lip-Emotion, a video-to-video translation architecture th...
research
10/26/2021

Emotion recognition in talking-face videos using persistent entropy and neural networks

The automatic recognition of a person's emotional state has become a ver...
research
01/31/2020

Detecting Emotion Primitives from Speech and their use in discerning Categorical Emotions

Emotion plays an essential role in human-to-human communication, enablin...
research
02/20/2023

Medical Face Masks and Emotion Recognition from the Body: Insights from a Deep Learning Perspective

The COVID-19 pandemic has undoubtedly changed the standards and affected...

Please sign up or login with your details

Forgot password? Click here to reset