PoserNet: Refining Relative Camera Poses Exploiting Object Detections

07/19/2022
by   Matteo Taiana, et al.
11

The estimation of the camera poses associated with a set of images commonly relies on feature matches between the images. In contrast, we are the first to address this challenge by using objectness regions to guide the pose estimation problem rather than explicit semantic object detections. We propose Pose Refiner Network (PoserNet) a light-weight Graph Neural Network to refine the approximate pair-wise relative camera poses. PoserNet exploits associations between the objectness regions - concisely expressed as bounding boxes - across multiple views to globally refine sparsely connected view graphs. We evaluate on the 7-Scenes dataset across varied sizes of graphs and show how this process can be beneficial to optimisation-based Motion Averaging algorithms improving the median error on the rotation by 62 degrees with respect to the initial estimates obtained based on bounding boxes. Code and data are available at https://github.com/IIT-PAVIS/PoserNet.

READ FULL TEXT

page 2

page 19

page 21

research
04/24/2020

YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation

While a great variety of 3D cameras have been introduced in recent years...
research
04/08/2021

DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency

Compared to 2D object bounding-box labeling, it is very difficult for hu...
research
05/02/2022

Dual networks based 3D Multi-Person Pose Estimation from Monocular Video

Monocular 3D human pose estimation has made progress in recent years. Mo...
research
05/24/2021

3D-Aware Ellipse Prediction for Object-Based Camera Pose Estimation

In this paper, we propose a method for coarse camera pose computation wh...
research
07/28/2020

Multi-camera Torso Pose Estimation using Graph Neural Networks

Estimating the location and orientation of humans is an essential skill ...
research
04/13/2020

End-to-End Estimation of Multi-Person 3D Poses from Multiple Cameras

We present an approach to estimate 3D poses of multiple people from mult...
research
08/18/2021

Pixel-Perfect Structure-from-Motion with Featuremetric Refinement

Finding local features that are repeatable across multiple views is a co...

Please sign up or login with your details

Forgot password? Click here to reset