Transparent objects are common in daily life. However, depth sensing for...
Vision Language Models (VLMs), which extend Large Language Models (LLM) ...
We propose a method that trains a neural radiance field (NeRF) to encode...
Given a 3D object, kinematic motion prediction aims to identify the mobi...
High-resolution images enable neural networks to learn richer visual
rep...
Generating lyrics and poems is one of the essential downstream tasks in ...
Human-centric perceptions include a variety of vision tasks, which have
...
Mainstream 3D representation learning approaches are built upon contrast...
Recent state-of-the-art source-free domain adaptation (SFDA) methods hav...
The success of deep learning heavily relies on large-scale data with
com...
Recent work on 4D point cloud sequences has attracted a lot of attention...
Learning descriptive 3D features is crucial for understanding 3D scenes ...
We present MoRig, a method that automatically rigs character meshes driv...
We propose AccoMontage2, a system capable of doing full-length song
harm...
This paper proposes a 4D backbone for long-term point cloud video
unders...
Humans with an average level of social cognition can infer the beliefs o...
Rigged puppets are one of the most prevalent representations to create 2...
This paper studies the problem of fixing malfunctional 3D objects. While...
Rotation equivariance has recently become a strongly desired property in...
Training a generalizable 3D part segmentation network is quite challengi...
Deep neural networks are able to memorize noisy labels easily with a sof...
A critical aspect of human visual perception is the ability to parse vis...
This paper proposes a method for representation learning of multimodal d...
Self-supervised representation learning is a critical problem in compute...
Localizing the camera in a known indoor environment is a key building bl...
Single-image 3D shape reconstruction is an important and long-standing
p...
We study an unsupervised domain adaptation problem for the semantic labe...
In this paper, we examine the long-neglected yet important effects of po...
When learning to sketch, beginners start with simple and flexible shapes...
Building home assistant robots has long been a pursuit for vision and
ro...
This paper addresses the task of category-level pose estimation for
arti...
Learning to encode differences in the geometry and (topological) structu...
The ability to generate novel, diverse, and realistic 3D shapes along wi...
We introduce CoSegNet, a deep neural network architecture for co-segment...
Surface-based geodesic topology provides strong cues for object semantic...
We introduce a novel 3D object proposal approach named Generative Shape
...
We present PartNet: a consistent, large-scale dataset of 3D objects anno...
We introduce, TextureNet, a neural network architecture designed to extr...
Fitting geometric primitives to 3D point cloud data bridges a gap betwee...
Object functionality is often expressed through part articulation -- as ...
We introduce a large-scale 3D shape understanding benchmark using data a...
Few prior works study deep learning on point sets. PointNet by Qi et al....
We propose a method for converting geometric shapes into hierarchically
...
Important high-level vision tasks such as human-object interaction, imag...
In this paper, we study the problem of semantic annotation on 3D models ...
In human-computer conversation systems, the context of a user-issued
utt...
We present ShapeNet: a richly-annotated, large-scale repository of shape...
Comparing two images in a view-invariant way has been a challenging prob...