PatternRank: Leveraging Pretrained Language Models and Part of Speech for Unsupervised Keyphrase Extraction

10/11/2022
by   Tim Schopf, et al.
0

Keyphrase extraction is the process of automatically selecting a small set of most relevant phrases from a given text. Supervised keyphrase extraction approaches need large amounts of labeled training data and perform poorly outside the domain of the training data. In this paper, we present PatternRank, which leverages pretrained language models and part-of-speech for unsupervised keyphrase extraction from single documents. Our experiments show PatternRank achieves higher precision, recall and F1-scores than previous state-of-the-art approaches. In addition, we present the KeyphraseVectorizers package, which allows easy modification of part-of-speech patterns for candidate keyphrase selection, and hence adaptation of our approach to any domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/13/2018

EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings

Keyphrase extraction is the task of automatically selecting a small set ...
research
10/14/2020

Unsupervised Relation Extraction from Language Models using Constrained Cloze Completion

We show that state-of-the-art self-supervised language models can be rea...
research
05/22/2023

Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition

Energy-based language models (ELMs) parameterize an unnormalized distrib...
research
09/18/2023

LLM4Jobs: Unsupervised occupation extraction and standardization leveraging Large Language Models

Automated occupation extraction and standardization from free-text job p...
research
04/19/2022

Unsupervised Numerical Reasoning to Extract Phenotypes from Clinical Text by Leveraging External Knowledge

Extracting phenotypes from clinical text has been shown to be useful for...
research
04/19/2023

Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes

A long standing goal of the data management community is to develop gene...
research
07/26/2019

Supervised and unsupervised neural approaches to text readability

We present a set of novel neural supervised and unsupervised approaches ...

Please sign up or login with your details

Forgot password? Click here to reset