InstMove: Instance Motion for Object-centric Video Segmentation

by   Qihao Liu, et al.

Despite significant efforts, cutting-edge video segmentation methods still remain sensitive to occlusion and rapid movement, due to their reliance on the appearance of objects in the form of object embeddings, which are vulnerable to these disturbances. A common solution is to use optical flow to provide motion information, but essentially it only considers pixel-level motion, which still relies on appearance similarity and hence is often inaccurate under occlusion and fast movement. In this work, we study the instance-level motion and present InstMove, which stands for Instance Motion for Object-centric Video Segmentation. In comparison to pixel-wise motion, InstMove mainly relies on instance-level motion information that is free from image feature embeddings, and features physical interpretations, making it more accurate and robust toward occlusion and fast-moving objects. To better fit in with the video segmentation tasks, InstMove uses instance masks to model the physical presence of an object and learns the dynamic model through a memory network to predict its position and shape in the next frame. With only a few lines of code, InstMove can be integrated into current SOTA methods for three different video segmentation tasks and boost their performance. Specifically, we improve the previous arts by 1.5 AP on OVIS dataset, which features heavy occlusions, and 4.9 AP on YouTubeVIS-Long dataset, which mainly contains fast-moving objects. These results suggest that instance-level motion is robust and accurate, and hence serving as a powerful solution in complex scenarios for object-centric video segmentation.


page 1

page 4

page 7


Region Aware Video Object Segmentation with Deep Motion Modeling

Current semi-supervised video object segmentation (VOS) methods usually ...

InstanceMotSeg: Real-time Instance Motion Segmentation for Autonomous Driving

Moving object segmentation is a crucial task for autonomous vehicles as ...

Instance Embedding Transfer to Unsupervised Video Object Segmentation

We propose a method for unsupervised video object segmentation by transf...

Segmenting Moving Objects via an Object-Centric Layered Representation

The objective of this paper is a model that is able to discover, track a...

Spacetime Graph Optimization for Video Object Segmentation

In this paper we address the challenging task of object discovery and se...

Object Instance Identification in Dynamic Environments

We study the problem of identifying object instances in a dynamic enviro...

3D Trajectory Reconstruction of Dynamic Objects Using Planarity Constraints

We present a method to reconstruct the three-dimensional trajectory of a...

Please sign up or login with your details

Forgot password? Click here to reset