MoCap-guided Data Augmentation for 3D Pose Estimation in the Wild

by   Grégory Rogez, et al.

This paper addresses the problem of 3D human pose estimation in the wild. A significant challenge is the lack of training data, i.e., 2D images of humans annotated with 3D poses. Such data is necessary to train state-of-the-art CNN architectures. Here, we propose a solution to generate a large set of photorealistic synthetic images of humans with 3D pose annotations. We introduce an image-based synthesis engine that artificially augments a dataset of real images with 2D human pose annotations using 3D Motion Capture (MoCap) data. Given a candidate 3D pose our algorithm selects for each joint an image whose 2D pose locally matches the projected 3D pose. The selected images are then combined to generate a new synthetic image by stitching local image patches in a kinematically constrained manner. The resulting images are used to train an end-to-end CNN for full-body 3D pose estimation. We cluster the training data into a large number of pose classes and tackle pose estimation as a K-way classification problem. Such an approach is viable only with large training sets such as ours. Our method outperforms the state of the art in terms of 3D pose estimation in controlled environments (Human3.6M) and shows promising results for in-the-wild images (LSP). This demonstrates that CNNs trained on artificial images generalize well to real images.


Image-based Synthesis for Deep 3D Human Pose Estimation

This paper addresses the problem of 3D human pose estimation in the wild...

Towards Generalization of 3D Human Pose Estimation In The Wild

In this paper, we propose 3DBodyTex.Pose, a dataset that addresses the t...

Sim2real transfer learning for 3D pose estimation: motion to the rescue

Simulation is an anonymous, low-bias source of data where annotation can...

Post-Data Augmentation to Improve Deep Pose Estimation of Extreme and Wild Motions

Contributions of recent deep-neural-network (DNN) based techniques have ...

In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations

Convolutional Neural Network based approaches for monocular 3D human pos...

Of Mice and Pose: 2D Mouse Pose Estimation from Unlabelled Data and Synthetic Prior

Numerous fields, such as ecology, biology, and neuroscience, use animal ...

Exemplar Fine-Tuning for 3D Human Pose Fitting Towards In-the-Wild 3D Human Pose Estimation

We propose a method for building large collections of human poses with f...

Please sign up or login with your details

Forgot password? Click here to reset