User-Guided Aspect Classification for Domain-Specific Texts

04/30/2020
by   Peiran Li, et al.
0

Aspect classification, identifying aspects of text segments, facilitates numerous applications, such as sentiment analysis and review summarization. To alleviate the human effort on annotating massive texts, in this paper, we study the problem of classifying aspects based on only a few user-provided seed words for pre-defined aspects. The major challenge lies in how to handle the noisy misc aspect, which is designed for texts without any pre-defined aspects. Even domain experts have difficulties to nominate seed words for the misc aspect, making existing seed-driven text classification methods not applicable. We propose a novel framework, ARYA, which enables mutual enhancements between pre-defined aspects and the misc aspect via iterative classifier training and seed updating. Specifically, it trains a classifier for pre-defined aspects and then leverages it to induce the supervision for the misc aspect. The prediction results of the misc aspect are later utilized to filter out noisy seed words for pre-defined aspects. Experiments in two domains demonstrate the superior performance of our proposed framework, as well as the necessity and importance of properly modeling the misc aspect.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2019

Leveraging Just a Few Keywords for Fine-Grained Aspect Detection Through Weakly Supervised Co-Training

User-generated reviews can be decomposed into fine-grained segments (e.g...
research
05/22/2023

A Benchmark on Extremely Weakly Supervised Text Classification: Reconcile Seed Matching and Prompting Approaches

Etremely Weakly Supervised Text Classification (XWS-TC) refers to text c...
research
10/14/2020

Summarizing Text on Any Aspects: A Knowledge-Informed Weakly-Supervised Approach

Given a document and a target aspect (e.g., a topic of interest), aspect...
research
12/16/2021

Hyperbolic Disentangled Representation for Fine-Grained Aspect Extraction

Automatic identification of salient aspects from user reviews is especia...
research
09/13/2017

Method for Aspect-Based Sentiment Annotation Using Rhetorical Analysis

This paper fills a gap in aspect-based sentiment analysis and aims to pr...
research
05/24/2023

Debiasing Made State-of-the-art: Revisiting the Simple Seed-based Weak Supervision for Text Classification

Recent advances in weakly supervised text classification mostly focus on...
research
06/03/2017

Task-specific Word Identification from Short Texts Using a Convolutional Neural Network

Task-specific word identification aims to choose the task-related words ...

Please sign up or login with your details

Forgot password? Click here to reset