IA-MOT: Instance-Aware Multi-Object Tracking with Motion Consistency

06/24/2020
by   Jiarui Cai, et al.
0

Multiple object tracking (MOT) is a crucial task in computer vision society. However, most tracking-by-detection MOT methods, with available detected bounding boxes, cannot effectively handle static, slow-moving and fast-moving camera scenarios simultaneously due to ego-motion and frequent occlusion. In this work, we propose a novel tracking framework, called "instance-aware MOT" (IA-MOT), that can track multiple objects in either static or moving cameras by jointly considering the instance-level features and object motions. First, robust appearance features are extracted from a variant of Mask R-CNN detector with an additional embedding head, by sending the given detections as the region proposals. Meanwhile, the spatial attention, which focuses on the foreground within the bounding boxes, is generated from the given instance masks and applied to the extracted embedding features. In the tracking stage, object instance masks are aligned by feature similarity and motion consistency using the Hungarian association algorithm. Moreover, object re-identification (ReID) is incorporated to recover ID switches caused by long-term occlusion or missing detection. Overall, when evaluated on the MOTS20 and KITTI-MOTS dataset, our proposed method won the first place in Track 3 of the BMTT Challenge in CVPR2020 workshops.

READ FULL TEXT
research
08/30/2023

Occlusion-Aware Detection and Re-ID Calibrated Network for Multi-Object Tracking

Multi-Object Tracking (MOT) is a crucial computer vision task that aims ...
research
09/30/2019

Track to Reconstruct and Reconstruct to Track

Object tracking and reconstruction are often performed together, with tr...
research
02/05/2018

Tracking Multiple Moving Objects Using Unscented Kalman Filtering Techniques

It is an important task to reliably detect and track multiple moving obj...
research
06/15/2020

3D-ZeF: A 3D Zebrafish Tracking Benchmark Dataset

In this work we present a novel publicly available stereo based 3D RGB d...
research
05/01/2023

Event Camera as Region Proposal Network

The human eye consists of two types of photoreceptors, rods and cones. R...
research
05/22/2023

Type-to-Track: Retrieve Any Object via Prompt-based Tracking

One of the recent trends in vision problems is to use natural language c...
research
08/15/2022

Automatic Controlling Fish Feeding Machine using Feature Extraction of Nutriment and Ripple Behavior

Controlling fish feeding machine is challenging problem because experien...

Please sign up or login with your details

Forgot password? Click here to reset