Candidate Generation with Binary Codes for Large-Scale Top-N Recommendation

09/12/2019
by   Wang-Cheng Kang, et al.
0

Generating the Top-N recommendations from a large corpus is computationally expensive to perform at scale. Candidate generation and re-ranking based approaches are often adopted in industrial settings to alleviate efficiency problems. However it remains to be fully studied how well such schemes approximate complete rankings (or how many candidates are required to achieve a good approximation), or to develop systematic approaches to generate high-quality candidates efficiently. In this paper, we seek to investigate these questions via proposing a candidate generation and re-ranking based framework (CIGAR), which first learns a preference-preserving binary embedding for building a hash table to retrieve candidates, and then learns to re-rank the candidates using real-valued ranking models with a candidate-oriented objective. We perform a comprehensive study on several large-scale real-world datasets consisting of millions of users/items and hundreds of millions of interactions. Our results show that CIGAR significantly boosts the Top-N accuracy against state-of-the-art recommendation models, while reducing the query time by orders of magnitude. We hope that this work could draw more attention to the candidate generation problem in recommender systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/13/2021

Lessons Learned Addressing Dataset Bias in Model-Based Candidate Generation at Twitter

Traditionally, heuristic methods are used to generate candidates for lar...
research
05/12/2022

kNN-Embed: Locally Smoothed Embedding Mixtures For Multi-interest Candidate Retrieval

Candidate generation is the first stage in recommendation systems, where...
research
02/27/2023

TwERC: High Performance Ensembled Candidate Generation for Ads Recommendation at Twitter

Recommendation systems are a core feature of social media companies with...
research
06/24/2019

Query-based Interactive Recommendation by Meta-Path and Adapted Attention-GRU

Recently, interactive recommender systems are becoming increasingly popu...
research
02/19/2019

Joint Optimization of Tree-based Index and Deep Model for Recommender Systems

Large-scale industrial recommender systems are usually confronted with c...
research
07/20/2021

Paraphrasing via Ranking Many Candidates

We present a simple and effective way to generate a variety of paraphras...
research
07/12/2020

Deep Retrieval: An End-to-End Learnable Structure Model for Large-Scale Recommendations

One of the core problems in large-scale recommendations is to retrieve t...

Please sign up or login with your details

Forgot password? Click here to reset