Learning Articulated Shape with Keypoint Pseudo-labels from Web Images

04/27/2023
by   Anastasis Stathopoulos, et al.
0

This paper shows that it is possible to learn models for monocular 3D reconstruction of articulated objects (e.g., horses, cows, sheep), using as few as 50-150 images labeled with 2D keypoints. Our proposed approach involves training category-specific keypoint estimators, generating 2D keypoint pseudo-labels on unlabeled web images, and using both the labeled and self-labeled sets to train 3D reconstruction models. It is based on two key insights: (1) 2D keypoint estimation networks trained on as few as 50-150 images of a given object category generalize well and generate reliable pseudo-labels; (2) a data selection mechanism can automatically create a "curated" subset of the unlabeled web images that can be used for training – we evaluate four data selection methods. Coupling these two insights enables us to train models that effectively utilize web images, resulting in improved 3D reconstruction performance for several articulated object categories beyond the fully-supervised baseline. Our approach can quickly bootstrap a model and requires only a few images labeled with 2D keypoints. This requirement can be easily satisfied for any new object category. To showcase the practicality of our approach for predicting the 3D shape of arbitrary object categories, we annotate 2D keypoints on giraffe and bear images from COCO – the annotation process takes less than 1 minute per image.

READ FULL TEXT

page 12

page 13

page 14

page 15

page 16

page 17

page 18

page 20

research
01/21/2022

Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization

Localizing keypoints of an object is a basic visual problem. However, su...
research
12/01/2019

Training Object Detectors from Few Weakly-Labeled and Many Unlabeled Images

Weakly-supervised object detection attempts to limit the amount of super...
research
03/30/2023

Few-shot Geometry-Aware Keypoint Localization

Supervised keypoint localization methods rely on large manually labeled ...
research
11/14/2022

Piecewise Planar Hulls for Semi-Supervised Learning of 3D Shape and Pose from 2D Images

We study the problem of estimating 3D shape and pose of an object in ter...
research
02/09/2023

MAPS: A Noise-Robust Progressive Learning Approach for Source-Free Domain Adaptive Keypoint Detection

Existing cross-domain keypoint detection methods always require accessin...
research
09/13/2021

Vision-based system identification and 3D keypoint discovery using dynamics constraints

This paper introduces V-SysId, a novel method that enables simultaneous ...
research
05/12/2019

Integrating Objects into Monocular SLAM: Line Based Category Specific Models

We propose a novel Line based parameterization for category specific CAD...

Please sign up or login with your details

Forgot password? Click here to reset