Dataset Construction via Attention for Aspect Term Extraction with Distant Supervision

Aspect Term Extraction (ATE) detects opinionated aspect terms in sentences or text spans, with the end goal of performing aspect-based sentiment analysis. The small amount of available datasets for supervised ATE and the fact that they cover only a few domains raise the need for exploiting other data sources in new and creative ways. Publicly available review corpora contain a plethora of opinionated aspect terms and cover a larger domain spectrum. In this paper, we first propose a method for using such review corpora for creating a new dataset for ATE. Our method relies on an attention mechanism to select sentences that have a high likelihood of containing actual opinionated aspects. We thus improve the quality of the extracted aspects. We then use the constructed dataset to train a model and perform ATE with distant supervision. By evaluating on human annotated datasets, we prove that our method achieves a significantly improved performance over various unsupervised and supervised baselines. Finally, we prove that sentence selection matters when it comes to creating new datasets for ATE. Specifically, we show that, using a set of selected sentences leads to higher ATE performance compared to using the whole sentence set.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2020

Transformer-based Multi-Aspect Modeling for Multi-Aspect Multi-Sentiment Analysis

Aspect-based sentiment analysis (ABSA) aims at analyzing the sentiment o...
research
06/17/2020

Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification

Deep learning architectures based on self-attention have recently achiev...
research
12/23/2019

Hunt Protagonist of Sentiment: Sentiment Analysis via Capsule Network with Sentiment-Aspect Reconstruction

Aspect-term level sentiment analysis (ATSA) is a fine-grained task in se...
research
09/15/2017

Unsupervised Aspect Term Extraction with B-LSTM & CRF using Automatically Labelled Datasets

Aspect Term Extraction (ATE) identifies opinionated aspect terms in text...
research
02/20/2020

Aspect Term Extraction using Graph-based Semi-Supervised Learning

Aspect based Sentiment Analysis is a major subarea of sentiment analysis...
research
05/06/2016

Detecting Context Dependence in Exercise Item Candidates Selected from Corpora

We explore the factors influencing the dependence of single sentences on...
research
10/30/2018

Improving Distant Supervision with Maxpooled Attention and Sentence-Level Supervision

We propose an effective multitask learning setup for reducing distant su...

Please sign up or login with your details

Forgot password? Click here to reset