DeepAI AI Chat
Log In Sign Up

Counting with Adaptive Auxiliary Learning

by   Yanda Meng, et al.

This paper proposes an adaptive auxiliary task learning based approach for object counting problems. Unlike existing auxiliary task learning based methods, we develop an attention-enhanced adaptively shared backbone network to enable both task-shared and task-tailored features learning in an end-to-end manner. The network seamlessly combines standard Convolution Neural Network (CNN) and Graph Convolution Network (GCN) for feature extraction and feature reasoning among different domains of tasks. Our approach gains enriched contextual information by iteratively and hierarchically fusing the features across different task branches of the adaptive CNN backbone. The whole framework pays special attention to the objects' spatial locations and varied density levels, informed by object (or crowd) segmentation and density level segmentation auxiliary tasks. In particular, thanks to the proposed dilated contrastive density loss function, our network benefits from individual and regional context supervision in terms of pixel-independent and pixel-dependent feature learning mechanisms, along with strengthened robustness. Experiments on seven challenging multi-domain datasets demonstrate that our method achieves superior performance to the state-of-the-art auxiliary task learning based counting methods. Our code is made publicly available at:


page 1

page 4

page 8

page 9

page 11


Semi-supervised Crowd Counting via Density Agency

In this paper, we propose a new agency-guided semi-supervised counting a...

Spatial Uncertainty-Aware Semi-Supervised Crowd Counting

Semi-supervised approaches for crowd counting attract attention, as the ...

MTCNET: Multi-task Learning Paradigm for Crowd Count Estimation

We propose a Multi-Task Learning (MTL) paradigm based deep neural networ...

Multiscale Crowd Counting and Localization By Multitask Point Supervision

We propose a multitask approach for crowd counting and person localizati...

MTLSegFormer: Multi-task Learning with Transformers for Semantic Segmentation in Precision Agriculture

Multi-task learning has proven to be effective in improving the performa...

Words aren't enough, their order matters: On the Robustness of Grounding Visual Referring Expressions

Visual referring expression recognition is a challenging task that requi...

Medusa: Universal Feature Learning via Attentional Multitasking

Recent approaches to multi-task learning (MTL) have focused on modelling...