Multi-Semantic Interactive Learning for Object Detection

03/18/2023
by   Shuxin Wang, et al.
0

Single-branch object detection methods use shared features for localization and classification, yet the shared features are not fit for the two different tasks simultaneously. Multi-branch object detection methods usually use different features for localization and classification separately, ignoring the relevance between different tasks. Therefore, we propose multi-semantic interactive learning (MSIL) to mine the semantic relevance between different branches and extract multi-semantic enhanced features of objects. MSIL first performs semantic alignment of regression and classification branches, then merges the features of different branches by semantic fusion, finally extracts relevant information by semantic separation and passes it back to the regression and classification branches respectively. More importantly, MSIL can be integrated into existing object detection nets as a plug-and-play component. Experiments on the MS COCO, and Pascal VOC datasets show that the integration of MSIL with existing algorithms can utilize the relevant information between semantics of different tasks and achieve better performance.

READ FULL TEXT

page 3

page 7

page 8

research
04/03/2018

Exploring Multi-Branch and High-Level Semantic Networks for Improving Pedestrian Detection

To better detect pedestrians of various scales, deep multi-scale methods...
research
07/11/2019

A Survey of Deep Learning-based Object Detection

Object detection is one of the most important and challenging branches o...
research
07/05/2020

Attention-based Joint Detection of Object and Semantic Part

In this paper, we address the problem of joint detection of objects like...
research
08/24/2021

Reconcile Prediction Consistency for Balanced Object Detection

Classification and regression are two pillars of object detectors. In mo...
research
10/15/2021

Receptive Field Broadening and Boosting for Salient Object Detection

Salient object detection requires a comprehensive and scalable receptive...
research
07/06/2018

VLASE: Vehicle Localization by Aggregating Semantic Edges

In this paper, we propose VLASE, a framework to use semantic edge featur...
research
03/18/2019

IvaNet: Learning to jointly detect and segment objets with the help of Local Top-Down Modules

Driven by Convolutional Neural Networks, object detection and semantic s...

Please sign up or login with your details

Forgot password? Click here to reset