Exploring the Upper Limits of Text-Based Collaborative Filtering Using Large Language Models: Discoveries and Insights

05/19/2023
by   Ruyu Li, et al.
0

Text-based collaborative filtering (TCF) has become the mainstream approach for text and news recommendation, utilizing text encoders, also known as language models (LMs), to represent items. However, existing TCF models primarily focus on using small or medium-sized LMs. It remains uncertain what impact replacing the item encoder with one of the largest and most powerful LMs, such as the 175-billion parameter GPT-3 model, would have on recommendation performance. Can we expect unprecedented results? To this end, we conduct an extensive series of experiments aimed at exploring the performance limits of the TCF paradigm. Specifically, we increase the size of item encoders from one hundred million to one hundred billion to reveal the scaling limits of the TCF paradigm. We then examine whether these extremely large LMs could enable a universal item representation for the recommendation task. Furthermore, we compare the performance of the TCF paradigm utilizing the most powerful LMs to the currently dominant ID embedding-based paradigm and investigate the transferability of this TCF paradigm. Finally, we compare TCF with the recently popularized prompt-based recommendation using ChatGPT. Our research findings have not only yielded positive results but also uncovered some surprising and previously unknown negative outcomes, which can inspire deeper reflection and innovative thinking regarding text-based recommender systems. Codes and datasets will be released for further research.

READ FULL TEXT

page 6

page 18

page 19

page 20

page 21

research
09/24/2019

Quantitative analysis of Matthew effect and sparsity problem of recommender systems

Recommender systems have received great commercial success. Recommendati...
research
07/26/2021

Hierarchical Latent Relation Modeling for Collaborative Metric Learning

Collaborative Metric Learning (CML) recently emerged as a powerful parad...
research
03/24/2023

Where to Go Next for Recommender Systems? ID- vs. Modality-based Recommender Models Revisited

Recommendation models that utilize unique identities (IDs) to represent ...
research
04/26/2022

Hypergraph Contrastive Collaborative Filtering

Collaborative Filtering (CF) has emerged as fundamental paradigms for pa...
research
06/26/2022

Towards Representation Alignment and Uniformity in Collaborative Filtering

Collaborative filtering (CF) plays a critical role in the development of...
research
08/27/2023

Only Encode Once: Making Content-based News Recommender Greener

Large pretrained language models (PLM) have become de facto news encoder...
research
07/19/2022

HICF: Hyperbolic Informative Collaborative Filtering

Considering the prevalence of the power-law distribution in user-item ne...

Please sign up or login with your details

Forgot password? Click here to reset