Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Features

01/31/2021
by   Dongliang Chang, et al.
8

Fine-grained visual classification is a challenging task that recognizes the sub-classes belonging to the same meta-class. Large inter-class similarity and intra-class variance is the main challenge of this task. Most exiting methods try to solve this problem by designing complex model structures to explore more minute and discriminative regions. In this paper, we argue that mining multi-regional multi-grained features is precisely the key to this task. Specifically, we introduce a new loss function, termed top-down spatial attention loss (TDSA-Loss), which contains a multi-stage channel constrained module and a top-down spatial attention module. The multi-stage channel constrained module aims to make the feature channels in different stages category-aligned. Meanwhile, the top-down spatial attention module uses the attention map generated by high-level aligned feature channels to make middle-level aligned feature channels to focus on particular regions. Finally, we can obtain multiple discriminative regions on high-level feature channels and obtain multiple more minute regions within these discriminative regions on middle-level feature channels. In summary, we obtain multi-regional multi-grained features. Experimental results over four widely used fine-grained image classification datasets demonstrate the effectiveness of the proposed method. Ablative studies further show the superiority of two modules in the proposed method. Codes are available at: https://github.com/dongliangchang/Top-Down-Spatial-Attention-Loss.

READ FULL TEXT

page 1

page 2

page 5

page 8

research
02/11/2020

The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification

Key for solving fine-grained image categorization is finding discriminat...
research
01/24/2021

Grad-CAM guided channel-spatial attention module for fine-grained visual classification

Fine-grained visual classification (FGVC) is becoming an important resea...
research
12/21/2020

Knowledge Transfer Based Fine-grained Visual Classification

Fine-grained visual classification (FGVC) aims to distinguish the sub-cl...
research
12/28/2022

Part-guided Relational Transformers for Fine-grained Visual Recognition

Fine-grained visual recognition is to classify objects with visually sim...
research
11/26/2019

Multi-Task Driven Feature Models for Thermal Infrared Tracking

Existing deep Thermal InfraRed (TIR) trackers usually use the feature mo...
research
07/28/2023

Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification

The difficulty of the fine-grained image classification mainly comes fro...
research
01/05/2020

Spatial-Scale Aligned Network for Fine-Grained Recognition

Existing approaches for fine-grained visual recognition focus on learnin...

Please sign up or login with your details

Forgot password? Click here to reset