In-sample Contrastive Learning and Consistent Attention for Weakly Supervised Object Localization

09/25/2020
by   Minsong Ki, et al.
0

Weakly supervised object localization (WSOL) aims to localize the target object using only the image-level supervision. Recent methods encourage the model to activate feature maps over the entire object by dropping the most discriminative parts. However, they are likely to induce excessive extension to the backgrounds which leads to over-estimated localization. In this paper, we consider the background as an important cue that guides the feature activation to cover the sophisticated object region and propose contrastive attention loss. The loss promotes similarity between foreground and its dropped version, and, dissimilarity between the dropped version and background. Furthermore, we propose foreground consistency loss that penalizes earlier layers producing noisy attention regarding the later layer as a reference to provide them with a sense of backgroundness. It guides the early layers to activate on objects rather than locally distinctive backgrounds so that their attentions to be similar to the later layer. For better optimizing the above losses, we use the non-local attention blocks to replace channel-pooled attention leading to enhanced attention maps considering the spatial similarity. Last but not least, we propose to drop background regions in addition to the most discriminative region. Our method achieves state-of-theart performance on CUB-200-2011 and ImageNet benchmark datasets regarding top-1 localization accuracy and MaxBoxAccV2, and we provide detailed analysis on our individual components. The code will be publicly available online for reproducibility.

READ FULL TEXT

page 2

page 14

research
04/01/2022

Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization

Weakly supervised object localization aims to find a target object regio...
research
03/09/2020

Dual-attention Guided Dropblock Module for Weakly Supervised Object Localization

In this paper, we propose a dual-attention guided dropblock module, and ...
research
12/01/2021

Background Activation Suppression for Weakly Supervised Object Localization

Weakly supervised object localization (WSOL) aims to localize the object...
research
09/09/2019

Weakly Supervised Localization Using Background Images

Weakly Supervised Object Localization (WSOL) methodsusually rely on full...
research
10/12/2019

Combinational Class Activation Maps for Weakly Supervised Object Localization

Weakly supervised object localization has recently attracted attention s...
research
08/01/2016

Top-down Neural Attention by Excitation Backprop

We aim to model the top-down attention of a Convolutional Neural Network...
research
09/11/2019

Dual-attention Focused Module for Weakly Supervised Object Localization

The research on recognizing the most discriminative regions provides ref...

Please sign up or login with your details

Forgot password? Click here to reset