Describing Human Aesthetic Perception by Deeply-learned Attributes from Flickr

05/25/2016
by   L. Zhang, et al.
0

Many aesthetic models in computer vision suffer from two shortcomings: 1) the low descriptiveness and interpretability of those hand-crafted aesthetic criteria (i.e., nonindicative of region-level aesthetics), and 2) the difficulty of engineering aesthetic features adaptively and automatically toward different image sets. To remedy these problems, we develop a deep architecture to learn aesthetically-relevant visual attributes from Flickr1, which are localized by multiple textual attributes in a weakly-supervised setting. More specifically, using a bag-ofwords (BoW) representation of the frequent Flickr image tags, a sparsity-constrained subspace algorithm discovers a compact set of textual attributes (e.g., landscape and sunset) for each image. Then, a weakly-supervised learning algorithm projects the textual attributes at image-level to the highly-responsive image patches at pixel-level. These patches indicate where humans look at appealing regions with respect to each textual attribute, which are employed to learn the visual attributes. Psychological and anatomical studies have shown that humans perceive visual concepts hierarchically. Hence, we normalize these patches and feed them into a five-layer convolutional neural network (CNN) to mimick the hierarchy of human perceiving the visual attributes. We apply the learned deep features on image retargeting, aesthetics ranking, and retrieval. Both subjective and objective experimental results thoroughly demonstrate the competitiveness of our approach.

READ FULL TEXT

page 2

page 5

page 6

page 7

page 8

research
05/16/2019

Harvesting Information from Captions for Weakly Supervised Semantic Segmentation

Since acquiring pixel-wise annotations for training convolutional neural...
research
03/31/2015

Weakly Supervised Learning of Objects, Attributes and their Associations

When humans describe images they tend to use combinations of nouns and a...
research
12/16/2014

Discovering beautiful attributes for aesthetic image analysis

Aesthetic image analysis is the study and assessment of the aesthetic pr...
research
08/09/2016

End-to-End Localization and Ranking for Relative Attributes

We propose an end-to-end deep convolutional network to simultaneously lo...
research
12/13/2015

Deep Relative Attributes

Visual attributes are great means of describing images or scenes, in a w...
research
04/19/2015

DEEP-CARVING: Discovering Visual Attributes by Carving Deep Neural Nets

Most of the approaches for discovering visual attributes in images deman...
research
07/25/2016

Automatic Attribute Discovery with Neural Activations

How can a machine learn to recognize visual attributes emerging out of o...

Please sign up or login with your details

Forgot password? Click here to reset