STIR: Siamese Transformer for Image Retrieval Postprocessing

04/26/2023
by   Aleksei Shabanov, et al.
0

Current metric learning approaches for image retrieval are usually based on learning a space of informative latent representations where simple approaches such as the cosine distance will work well. Recent state of the art methods such as HypViT move to more complex embedding spaces that may yield better results but are harder to scale to production environments. In this work, we first construct a simpler model based on triplet loss with hard negatives mining that performs at the state of the art level but does not have these drawbacks. Second, we introduce a novel approach for image retrieval postprocessing called Siamese Transformer for Image Retrieval (STIR) that reranks several top outputs in a single forward pass. Unlike previously proposed Reranking Transformers, STIR does not rely on global/local feature extraction and directly compares a query image and a retrieved candidate on pixel level with the usage of attention mechanism. The resulting approach defines a new state of the art on standard image retrieval datasets: Stanford Online Products and DeepFashion In-shop. We also release the source code at https://github.com/OML-Team/open-metric-learning/tree/main/pipelines/postprocessing/ and an interactive demo of our approach at https://dapladoc-oml-postprocessing-demo-srcappmain-pfh2g0.streamlit.app/

READ FULL TEXT

page 2

page 11

research
02/10/2021

Training Vision Transformers for Image Retrieval

Transformers have shown outstanding results for natural language underst...
research
08/14/2023

Global Features are All You Need for Image Retrieval and Reranking

Utilizing a two-stage paradigm comprising of coarse image retrieval and ...
research
11/30/2018

Making Classification Competitive for Deep Metric Learning

Deep metric learning aims to learn a function mapping image pixels to em...
research
04/04/2022

Correlation Verification for Image Retrieval

Geometric verification is considered a de facto solution for the re-rank...
research
10/19/2020

Rotation Invariant Aerial Image Retrieval with Group Convolutional Metric Learning

Remote sensing image retrieval (RSIR) is the process of ranking database...
research
06/01/2023

Class Anchor Margin Loss for Content-Based Image Retrieval

The performance of neural networks in content-based image retrieval (CBI...
research
06/22/2019

Image Retrieval and Pattern Spotting using Siamese Neural Network

This paper presents a novel approach for image retrieval and pattern spo...

Please sign up or login with your details

Forgot password? Click here to reset