A Survey on Learning to Hash

06/01/2016
by   Jingdong Wang, et al.
0

Nearest neighbor search is a problem of finding the data points from the database such that the distances from them to the query point are the smallest. Learning to hash is one of the major solutions to this problem and has been widely studied recently. In this paper, we present a comprehensive survey of the learning to hash algorithms, categorize them according to the manners of preserving the similarities into: pairwise similarity preserving, multiwise similarity preserving, implicit similarity preserving, as well as quantization, and discuss their relations. We separate quantization from pairwise similarity preserving as the objective function is very different though quantization, as we show, can be derived from preserving the pairwise similarities. In addition, we present the evaluation protocols, and the general performance analysis, and point out that the quantization algorithms perform superiorly in terms of search accuracy, search time cost, and space cost. Finally, we introduce a few emerging topics.

READ FULL TEXT
research
08/13/2014

Hashing for Similarity Search: A Survey

Similarity search (nearest neighbor search) is a problem of pursuing the...
research
04/18/2019

Query-Adaptive Hash Code Ranking for Large-Scale Multi-View Visual Search

Hash based nearest neighbor search has become attractive in many applica...
research
02/10/2021

Leveraging Reinforcement Learning for evaluating Robustness of KNN Search Algorithms

The problem of finding K-nearest neighbors in the given dataset for a gi...
research
11/07/2011

Multimodal diff-hash

Many applications require comparing multimodal data with different struc...
research
02/06/2017

Learning similarity preserving representations with neural similarity encoders

Many dimensionality reduction or manifold learning algorithms optimize f...
research
09/18/2021

When Similarity Digest Meets Vector Management System: A Survey on Similarity Hash Function

The booming vector manage system calls for feasible similarity hash func...
research
11/02/2019

ProbMinHash – A Class of Locality-Sensitive Hash Algorithms for the (Probability) Jaccard Similarity

The probability Jaccard similarity was recently proposed as a natural ge...

Please sign up or login with your details

Forgot password? Click here to reset