i-Razor: A Neural Input Razor for Feature Selection and Dimension Search in Large-Scale Recommender Systems

04/01/2022
by   Yao Yao, et al.
0

Input features play a crucial role in the predictive performance of DNN-based industrial recommender systems with thousands of categorical and continuous fields from users, items, contexts, and their interactions. Noisy features and inappropriate embedding dimension assignments can impair the performance of recommender systems and introduce unnecessary complexity in model training and online serving. Optimizing the input configuration of DNN models, including feature selection and embedding dimension assignment, has become one of the essential topics in feature engineering. Typically, feature selection and embedding dimension search are optimized sequentially, i.e., feature selection is performed first, followed by embedding dimension search to determine the optimal dimension size for each selected feature. In contrast, this paper studies the joint optimization of feature selection and embedding dimension search. To this end, we propose a differentiable neural input razor, namely i-Razor. Specifically, inspired by recent advances in neural architecture search, we introduce an end-to-end differentiable model to learn the relative importance between different embedding regions of each feature. Furthermore, a flexible pruning algorithm is proposed to simultaneously achieve feature filtering and dimension size derivation. Extensive experiments on two large-scale public datasets in the Click-Through-Rate (CTR) prediction task demonstrate the efficacy and superiority of i-Razor in balancing model complexity and performance.

READ FULL TEXT
research
06/26/2020

Memory-efficient Embedding for Recommendations

Practical large-scale recommender systems usually contain thousands of f...
research
04/19/2022

AutoField: Automating Feature Selection in Deep Recommender Systems

Feature quality has an impactful effect on recommendation performance. T...
research
12/16/2020

AutoDis: Automatic Discretization for Embedding Numerical Features in CTR Prediction

Learning sophisticated feature interactions is crucial for Click-Through...
research
09/14/2023

iHAS: Instance-wise Hierarchical Architecture Search for Deep Learning Recommendation Models

Current recommender systems employ large-sized embedding tables with uni...
research
02/26/2023

Data-Centric AI: Deep Generative Differentiable Feature Selection via Discrete Subsetting as Continuous Embedding Space Optimization

Feature Selection (FS), such as filter, wrapper, and embedded methods, a...
research
04/07/2022

Single-shot Embedding Dimension Search in Recommender System

As a crucial component of most modern deep recommender systems, feature ...
research
06/28/2022

Meta-Wrapper: Differentiable Wrapping Operator for User Interest Selection in CTR Prediction

Click-through rate (CTR) prediction, whose goal is to predict the probab...

Please sign up or login with your details

Forgot password? Click here to reset