XMem++: Production-level Video Segmentation From Few Annotated Frames

07/29/2023
by   Maksym Bekuzarov, et al.
0

Despite advancements in user-guided video segmentation, extracting complex objects consistently for highly complex scenes is still a labor-intensive task, especially for production. It is not uncommon that a majority of frames need to be annotated. We introduce a novel semi-supervised video object segmentation (SSVOS) model, XMem++, that improves existing memory-based models, with a permanent memory module. Most existing methods focus on single frame annotations, while our approach can effectively handle multiple user-selected frames with varying appearances of the same object or region. Our method can extract highly consistent results while keeping the required number of frame annotations low. We further introduce an iterative and attention-based frame suggestion mechanism, which computes the next best frame for annotation. Our method is real-time and does not require retraining after each user input. We also introduce a new dataset, PUMaVOS, which covers new challenging use cases not found in previous benchmarks. We demonstrate SOTA performance on challenging (partial and multi-class) segmentation scenarios as well as long videos, while ensuring significantly fewer frame annotations than any existing method. Project page: https://max810.github.io/xmem2-project-page/

READ FULL TEXT

page 1

page 5

page 7

page 8

page 12

page 13

page 16

page 17

research
03/28/2019

BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting Frames

Semi-supervised video object segmentation has made significant progress ...
research
12/16/2021

HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images

Existing state-of-the-art methods for Video Object Segmentation (VOS) le...
research
12/08/2016

Learning Video Object Segmentation from Static Images

Inspired by recent advances of deep learning in instance segmentation an...
research
04/21/2021

Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps

We propose a novel guided interactive segmentation (GIS) algorithm for v...
research
07/16/2020

Interactive Video Object Segmentation Using Global and Local Transfer Modules

An interactive video object segmentation algorithm, which takes scribble...
research
04/28/2022

Streaming Multiscale Deep Equilibrium Models

We present StreamDEQ, a method that infers frame-wise representations on...
research
03/25/2023

Reliability-Hierarchical Memory Network for Scribble-Supervised Video Object Segmentation

This paper aims to solve the video object segmentation (VOS) task in a s...

Please sign up or login with your details

Forgot password? Click here to reset