Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention

06/14/2022
by   Quanzeng You, et al.
2

Video instance segmentation aims at predicting object segmentation masks for each frame, as well as associating the instances across multiple frames. Recent end-to-end video instance segmentation methods are capable of performing object segmentation and instance association together in a direct parallel sequence decoding/prediction framework. Although these methods generally predict higher quality object segmentation masks, they can fail to associate instances in challenging cases because they do not explicitly model the temporal instance consistency for adjacent frames. We propose a consistent end-to-end video instance segmentation framework with Inter-Frame Recurrent Attention to model both the temporal instance consistency for adjacent frames and the global temporal context. Our extensive experiments demonstrate that the Inter-Frame Recurrent Attention significantly improves temporal instance consistency while maintaining the quality of the object segmentation masks. Our model achieves state-of-the-art accuracy on both YouTubeVIS-2019 (62.1%) and YouTubeVIS-2021 (54.7%) datasets. In addition, quantitative and qualitative results show that the proposed methods predict more temporally consistent instance segmentation masks.

READ FULL TEXT

page 1

page 4

page 9

research
06/07/2023

RefineVIS: Video Instance Segmentation with Temporal Attention Refinement

We introduce a novel framework called RefineVIS for Video Instance Segme...
research
11/30/2020

End-to-End Video Instance Segmentation with Transformers

Video instance segmentation (VIS) is the task that requires simultaneous...
research
06/12/2021

1st Place Solution for YouTubeVOS Challenge 2021:Video Instance Segmentation

Video Instance Segmentation (VIS) is a multi-task problem performing det...
research
12/19/2018

Unsupervised Video Object Segmentation with Distractor-Aware Online Adaptation

Unsupervised video object segmentation is a crucial application in video...
research
12/05/2019

PolyTransform: Deep Polygon Transformer for Instance Segmentation

In this paper, we propose PolyTransform, a novel instance segmentation a...
research
02/01/2021

Consistent Recurrent Neural Networks for 3D Neuron Segmentation

We present a recurrent network for the 3D reconstruction of neurons that...
research
03/02/2022

Hybrid Tracker with Pixel and Instance for Video Panoptic Segmentation

Video Panoptic Segmentation (VPS) requires generating consistent panopti...

Please sign up or login with your details

Forgot password? Click here to reset