We present a method that reconstructs and animates a 3D head avatar from...
We present SSOD, the first end-to-end analysis-by synthesis framework wi...
Training on synthetic data can be beneficial for label or data-scarce
sc...
Tracking segmentation masks of multiple instances has been intensively
s...
One central question for video action recognition is how to model motion...
With the growing attention on learning-to-learn new tasks using only a f...
Training deep neural networks to estimate the viewpoint of objects requi...
We learn a self-supervised, single-view 3D reconstruction model that pre...
Weakly-supervised semantic segmentation is a challenging task as no
pixe...
This paper proposes to learn reliable dense correspondence from videos i...
Processing an input signal that contains arbitrary structures, e.g.,
sup...
Viewpoint estimation for known categories of objects has been improved
s...
Parts provide a good intermediate representation of objects that is robu...
Affordance modeling plays an important role in visual understanding. In ...
Learning to insert an object instance into an image in a semantically
co...
Learning to insert an object instance into an image in a semantically
co...
Given a random pair of images, an arbitrary style transfer method extrac...
In this paper, we propose a general dual convolutional neural network
(D...
Videos contain highly redundant information between frames. Such redunda...
In this paper, we propose a learning-based method to compose a video-sto...
In this paper, we propose spatial propagation networks for learning the
...
We propose a deep learning-based framework for instance-level object
seg...
Despite rapid advances in face recognition, there remains a clear gap be...
Face parsing is an important problem in computer vision that finds numer...