Neural Based Statement Classification for Biased Language

by   Christoph Hube, et al.

Biased language commonly occurs around topics which are of controversial nature, thus, stirring disagreement between the different involved parties of a discussion. This is due to the fact that for language and its use, specifically, the understanding and use of phrases, the stances are cohesive within the particular groups. However, such cohesiveness does not hold across groups. In collaborative environments or environments where impartial language is desired (e.g. Wikipedia, news media), statements and the language therein should represent equally the involved parties and be neutrally phrased. Biased language is introduced through the presence of inflammatory words or phrases, or statements that may be incorrect or one-sided, thus violating such consensus. In this work, we focus on the specific case of phrasing bias, which may be introduced through specific inflammatory words or phrases in a statement. For this purpose, we propose an approach that relies on a recurrent neural networks in order to capture the inter-dependencies between words in a phrase that introduced bias. We perform a thorough experimental evaluation, where we show the advantages of a neural based approach over competitors that rely on word lexicons and other hand-crafted features in detecting biased language. We are able to distinguish biased statements with a precision of P=0.92, thus significantly outperforming baseline models with an improvement of over 30 release the largest corpus of statements annotated for biased language.


page 1

page 2

page 3

page 4


Identification of Biased Terms in News Articles by Comparison of Outlet-specific Word Embeddings

Slanted news coverage, also called media bias, can heavily influence how...

Can We Derive Explicit and Implicit Bias from Corpus?

Language is a popular resource to mine speakers' attitude bias, supposin...

Detecting Media Bias in News Articles using Gaussian Bias Distributions

Media plays an important role in shaping public opinion. Biased media ca...

Euphemistic Phrase Detection by Masked Language Model

It is a well-known approach for fringe groups and organizations to use e...

It's All Relative: Interpretable Models for Scoring Bias in Documents

We propose an interpretable model to score the bias present in web docum...

Modeling the Unigram Distribution

The unigram distribution is the non-contextual probability of finding a ...

On Guiding Visual Attention with Language Specification

While real world challenges typically define visual categories with lang...

Please sign up or login with your details

Forgot password? Click here to reset