Scale-aware Insertion of Virtual Objects in Monocular Videos

12/04/2020
by   Songhai Zhang, et al.
0

In this paper, we propose a scale-aware method for inserting virtual objects with proper sizes into monocular videos. To tackle the scale ambiguity problem of geometry recovery from monocular videos, we estimate the global scale objects in a video with a Bayesian approach incorporating the size priors of objects, where the scene objects sizes should strictly conform to the same global scale and the possibilities of global scales are maximized according to the size distribution of object categories. To do so, we propose a dataset of sizes of object categories: Metric-Tree, a hierarchical representation of sizes of more than 900 object categories with the corresponding images. To handle the incompleteness of objects recovered from videos, we propose a novel scale estimation method that extracts plausible dimensions of objects for scale optimization. Experiments have shown that our method for scale estimation performs better than the state-of-the-art methods, and has considerable validity and robustness for different video scenes. Metric-Tree has been made available at: https://metric-tree.github.io

READ FULL TEXT

page 1

page 4

page 5

page 7

research
02/10/2022

Scale Estimation with Dual Quadrics for Monocular Object SLAM

The scale ambiguity problem is inherently unsolvable to monocular SLAM w...
research
06/08/2023

2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction

With the advent of the big model era, the demand for data has become mor...
research
05/13/2021

SizeNet: Object Recognition via Object Real Size-based convolutional networks

Inspired by the conclusion that human choose the visual cortex regions w...
research
06/27/2018

Primary Object Segmentation in Aerial Videos via Hierarchical Temporal Slicing and Co-Segmentation

Primary object segmentation plays an important role in understanding vid...
research
05/27/2017

Probabilistic Global Scale Estimation for MonoSLAM Based on Generic Object Detection

This paper proposes a novel method to estimate the global scale of a 3D ...
research
12/06/2018

ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape

We present a deep learning method for end-to-end monocular 3D object det...
research
05/10/2023

Reconstructing Animatable Categories from Videos

Building animatable 3D models is challenging due to the need for 3D scan...

Please sign up or login with your details

Forgot password? Click here to reset