Distributional Term Set Expansion

02/14/2018
by   Amaru Cuba Gyllensten, et al.
0

This paper is a short empirical study of the performance of centrality and classification based iterative term set expansion methods for distributional semantic models. Iterative term set expansion is an interactive process using distributional semantics models where a user labels terms as belonging to some sought after term set, and a system uses this labeling to supply the user with new, candidate, terms to label, trying to maximize the number of positive examples found. While centrality based methods have a long history in term set expansion, we compare them to classification methods based on the the Simple Margin method, an Active Learning approach to classification using Support Vector Machines. Examining the performance of various centrality and classification based methods for a variety of distributional models over five different term sets, we can show that active learning based methods consistently outperform centrality based methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2018

Hearst Patterns Revisited: Automatic Hypernym Detection from Large Text Corpora

Methods for unsupervised hypernym detection may broadly be categorized a...
research
03/30/2021

Active Learning for Deep Object Detection via Probabilistic Modeling

Active learning aims to reduce labeling costs by selecting only the most...
research
05/03/2020

A Two-Stage Masked LM Method for Term Set Expansion

We tackle the task of Term Set Expansion (TSE): given a small seed set o...
research
10/11/2018

Predicting the Expansion of Concrete Exposed to Sulfate Attack with a Regression Model Based on a Performance Classification

This paper mainly describes the development of a new type of regression ...
research
10/11/2018

Regression Model for Predicting Expansion of Concrete Exposed to Sulfate Attack Based on Performance-based Classification

This paper mainly described development of a new kind of regression mode...
research
08/25/2017

Active Expansion Sampling for Learning Feasible Domains in an Unbounded Input Space

Many engineering problems require identifying feasible domains under imp...
research
08/06/2021

Analysis of Driving Scenario Trajectories with Active Learning

Annotating the driving scenario trajectories based only on explicit rule...

Please sign up or login with your details

Forgot password? Click here to reset