Text-to-image diffusion models, e.g. Stable Diffusion (SD), lately have ...
This paper proposes a transformer-based learned image compression system...
In this paper, we tackle two challenges in multimodal learning for visua...
Online continual learning (OCL) aims to enable model learning from a
non...
For monocular depth estimation, acquiring ground truths for real data is...
Due to the rise of spherical cameras, monocular 360 depth estimation bec...
Generating images from hand-drawings is a crucial and fundamental task i...
Current image-to-image translation methods formulate the task with
condi...
Self-supervised learning on point clouds has gained a lot of attention
r...
Most of the existing algorithms for zero-shot classification problems
ty...
On playing video games, different players usually have their own playsty...
Deep networks for Monocular Depth Estimation (MDE) have achieved promisi...
Faces manifest large variations in many aspects, such as identity,
expre...
Model-agnostic meta-learning (MAML) is one of the most popular and
widel...
In this paper we propose a novel point cloud generator that is able to
r...
In this work, we aim to address the 3D scene stylization problem - gener...
This paper presents a novel preconditioning strategy for the classic 8-p...
Although significant progress has been made in room layout estimation, m...
In this paper we propose a new problem scenario in image processing,
wid...
We propose a Paired Few-shot GAN (PFS-GAN) model for learning generators...
The current neural networks for semantic segmentation usually predict th...
In this paper, we address the problem of distillation-based class-increm...
There exist many forms of deep latent variable models, such as the
varia...
Recently proposed normalizing flow models such as Glow have been shown t...
Inferring the information of 3D layout from a single equirectangular pan...
Recently, end-to-end trainable deep neural networks have significantly
i...
Stereo matching and flow estimation are two essential tasks for scene
un...
The complementary characteristics of active and passive depth sensing
te...
In this paper we tackle the problem of unsupervised domain adaptation fo...
Style transfer has been widely applied to give real-world images a new
a...
With the increasing amount of video data, it is desirable to highlight o...
While representation learning aims to derive interpretable features for
...
Are we ready to segment consumer stereo videos? The amount of this data ...
We propose a novel superpixel-based multi-view convolutional neural netw...
The Histogram of Oriented Gradient (HOG) descriptor has led to many adva...