Self-Supervised Learning of Image Scale and Orientation

06/15/2022
by   Jongmin Lee, et al.
0

We study the problem of learning to assign a characteristic pose, i.e., scale and orientation, for an image region of interest. Despite its apparent simplicity, the problem is non-trivial; it is hard to obtain a large-scale set of image regions with explicit pose annotations that a model directly learns from. To tackle the issue, we propose a self-supervised learning framework with a histogram alignment technique. It generates pairs of image patches by random rescaling/rotating and then train an estimator to predict their scale/orientation values so that their relative difference is consistent with the rescaling/rotating used. The estimator learns to predict a non-parametric histogram distribution of scale/orientation without any supervision. Experiments show that it significantly outperforms previous methods in scale/orientation estimation and also improves image matching and 6 DoF camera pose estimation by incorporating our patch poses into a matching process.

READ FULL TEXT

page 2

page 4

page 6

research
04/19/2022

Self-Supervised Equivariant Learning for Oriented Keypoint Detection

Detecting robust keypoints from an image is an integral part of many com...
research
11/21/2022

Deep Projective Rotation Estimation through Relative Supervision

Orientation estimation is the core to a variety of vision and robotics t...
research
03/14/2023

PATS: Patch Area Transportation with Subdivision for Local Feature Matching

Local feature matching aims at establishing sparse correspondences betwe...
research
08/23/2019

Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry

We propose a self-supervised learning framework for visual odometry (VO)...
research
03/29/2022

PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision

Existing self-supervised 3D human pose estimation schemes have largely r...
research
12/06/2020

Temporal-Aware Self-Supervised Learning for 3D Hand Pose and Mesh Estimation in Videos

Estimating 3D hand pose directly from RGB imagesis challenging but has g...
research
01/15/2021

Catching Out-of-Context Misinformation with Self-supervised Learning

Despite the recent attention to DeepFakes and other forms of image manip...

Please sign up or login with your details

Forgot password? Click here to reset