UR2KiD: Unifying Retrieval, Keypoint Detection, and Keypoint Description without Local Correspondence Supervision

01/20/2020
by   Tsun-Yi Yang, et al.
19

In this paper, we explore how three related tasks, namely keypoint detection, description, and image retrieval can be jointly tackled using a single unified framework, which is trained without the need of training data with point to point correspondences. By leveraging diverse information from sequential layers of a standard ResNet-based architecture, we are able to extract keypoints and descriptors that encode local information using generic techniques such as local activation norms, channel grouping and dropping, and self-distillation. Subsequently, global information for image retrieval is encoded in an end-to-end pipeline, based on pooling of the aforementioned local responses. In contrast to previous methods in local matching, our method does not depend on pointwise/pixelwise correspondences, and requires no such supervision at all i.e. no depth-maps from an SfM model nor manually created synthetic affine transformations. We illustrate that this simple and direct paradigm, is able to achieve very competitive results against the state-of-the-art methods in various challenging benchmark conditions such as viewpoint changes, scale changes, and day-night shifting localization.

READ FULL TEXT

page 1

page 4

page 6

page 7

research
06/14/2019

R2D2: Repeatable and Reliable Detector and Descriptor

Interest point detection and local feature description are fundamental s...
research
06/14/2019

R2D2: Reliable and Repeatable Detectors and Descriptors for Joint Sparse Keypoint Detection and Local Feature Extraction

Interest point detection and local feature description are fundamental s...
research
10/21/2020

Learning to Guide Local Feature Matches

We tackle the problem of finding accurate and robust keypoint correspond...
research
02/22/2018

End-to-end learning of keypoint detector and descriptor for pose invariant 3D matching

Finding correspondences between images or 3D scans is at the heart of ma...
research
11/07/2016

Texture and Color-based Image Retrieval Using the Local Extrema Features and Riemannian Distance

A novel efficient method for content-based image retrieval (CBIR) is dev...
research
01/11/2021

Investigating the Vision Transformer Model for Image Retrieval Tasks

This paper introduces a plug-and-play descriptor that can be effectively...
research
07/18/2014

Affine Subspace Representation for Feature Description

This paper proposes a novel Affine Subspace Representation (ASR) descrip...

Please sign up or login with your details

Forgot password? Click here to reset