Video Object Segmentation with Joint Re-identification and Attention-Aware Mask Propagation

03/12/2018
by   Xiaoxiao Li, et al.
0

The problem of video object segmentation can become extremely challenging when multiple instances co-exist. While each instance may exhibit large scale and pose variations, the problem is compounded when instances occlude each other causing failures in tracking. In this study, we formulate a deep recurrent network that is capable of segmenting and tracking objects in video simultaneously by their temporal continuity, yet able to re-identify them when they re-appear after a prolonged occlusion. We combine both temporal propagation and re-identification functionalities into a single framework that can be trained end-to-end. In particular, we present a re-identification module with template expansion to retrieve missing objects despite their large appearance changes. In addition, we contribute a new attention-based recurrent mask propagation approach that is robust to distractors not belonging to the target segment. Our approach achieves a new state-of-the-art global mean (Region Jaccard and Boundary F measure) of 68.2 on the challenging DAVIS 2017 benchmark (test-dev set), outperforming the winning solution which achieves a global mean of 66.1 on the same partition.

READ FULL TEXT

page 1

page 4

page 5

page 6

page 8

research
08/01/2017

Video Object Segmentation with Re-identification

Conventional video segmentation methods often rely on temporal continuit...
research
05/24/2019

OVSNet : Towards One-Pass Real-Time Video Object Segmentation

Video object segmentation aims at accurately segmenting the target objec...
research
09/30/2019

LIP: Learning Instance Propagation for Video Object Segmentation

In recent years, the task of segmenting foreground objects from backgrou...
research
06/22/2021

Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation

Multiple object tracking and segmentation requires detecting, tracking, ...
research
12/16/2021

Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation

Video Panoptic Segmentation (VPS) aims at assigning a class label to eac...
research
07/05/2021

A topological solution to object segmentation and tracking

The world is composed of objects, the ground, and the sky. Visual percep...
research
04/17/2019

MHP-VOS: Multiple Hypotheses Propagation for Video Object Segmentation

We address the problem of semi-supervised video object segmentation (VOS...

Please sign up or login with your details

Forgot password? Click here to reset