GET-AID: Visual Recognition of Human Rights Abuses via Global Emotional Traits

by   Grigorios Kalliatakis, et al.

In the era of social media and big data, the use of visual evidence to document conflict and human rights abuse has become an important element for human rights organizations and advocates. In this paper, we address the task of detecting two types of human rights abuses in challenging, everyday photos: (1) child labour, and (2) displaced populations. We propose a novel model that is driven by a human-centric approach. Our hypothesis is that the emotional state of a person -- how positive or pleasant an emotion is, and the control level of the situation by the person -- are powerful cues for perceiving potential human rights violations. To exploit these cues, our model learns to predict global emotional traits over a given image based on the joint analysis of every detected person and the whole scene. By integrating these predictions with a data-driven convolutional neural network (CNN) classifier, our system efficiently infers potential human rights abuses in a clean, end-to-end system we call GET-AID (from Global Emotional Traits for Abuse IDentification). Extensive experiments are performed to verify our method on the recently introduced subset of Human Rights Archive (HRA) dataset (2 violation categories with the same number of positive and negative samples), where we show quantitatively compelling results. Compared with previous works and the sole use of a CNN classifier, this paper improves the coverage up to 23.73 child labour and 57.21 trained models are available online at


page 1

page 2

page 3

page 4

page 6

page 7

page 9

page 10


DisplaceNet: Recognising Displaced People from Images by Exploiting Dominance Level

Every year millions of men, women and children are forced to leave their...

Exploring object-centric and scene-centric CNN features and their complementarity for human rights violations recognition in images

Identifying potential abuses of human rights through imagery is a novel ...

Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech

Prosodic phrasing is crucial to the naturalness and intelligibility of e...

Using machine learning algorithms to determine the emotional disadaptation of a person by his rhythmogram

In this study we applyed machine-learning algorithms to determine the em...

Bi-modal First Impressions Recognition using Temporally Ordered Deep Audio and Stochastic Visual Features

We propose a novel approach for First Impressions Recognition in terms o...

Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification

Person re-identification is a challenging task due to various complex fa...

Learning Graph Representation of Person-specific Cognitive Processes from Audio-visual Behaviours for Automatic Personality Recognition

This approach builds on two following findings in cognitive science: (i)...

Please sign up or login with your details

Forgot password? Click here to reset