Triply Supervised Decoder Networks for Joint Detection and Segmentation

by   Jiale Cao, et al.

Joint object detection and semantic segmentation can be applied to many fields, such as self-driving cars and unmanned surface vessels. An initial and important progress towards this goal has been achieved by simply sharing the deep convolutional features for the two tasks. However, this simple scheme is unable to make full use of the fact that detection and segmentation are mutually beneficial. To overcome this drawback, we propose a framework called TripleNet where triple supervisions including detection-oriented supervision, class-aware segmentation supervision, and class-agnostic segmentation supervision are imposed on each layer of the decoder network. Class-agnostic segmentation supervision provides an objectness prior knowledge for both semantic segmentation and object detection. Besides the three types of supervisions, two light-weight modules (i.e., inner-connected module and attention skip-layer fusion) are also incorporated into each layer of the decoder. In the proposed framework, detection and segmentation can sufficiently boost each other. Moreover, class-agnostic and class-aware segmentation on each decoder layer are not performed at the test stage. Therefore, no extra computational costs are introduced at the test stage. Experimental results on the VOC2007 and VOC2012 datasets demonstrate that the proposed TripleNet is able to improve both the detection and segmentation accuracies without adding extra computational costs.


page 3

page 4

page 6


Single-Shot Object Detection with Enriched Semantics

We propose a novel single shot object detection network named Detection ...

Real-time Joint Object Detection and Semantic Segmentation Network for Automated Driving

Convolutional Neural Networks (CNN) are successfully used for various vi...

Investigating the feature collection for semantic segmentation via single skip connection

Since the study of deep convolutional neural network became prevalent, o...

Semantic Edge Detection with Diverse Deep Supervision

Semantic edge detection (SED), which aims at jointly extracting edges as...

Edge-aware Plug-and-play Scheme for Semantic Segmentation

Semantic segmentation is a classic and fundamental computer vision probl...

Semi-Supervised Segmentation of Concrete Aggregate Using Consensus Regularisation and Prior Guidance

In order to leverage and profit from unlabelled data, semi-supervised fr...

Hadamard Layer to Improve Semantic Segmentation

The Hadamard Layer, a simple and computationally efficient way to improv...

Please sign up or login with your details

Forgot password? Click here to reset