WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations

07/07/2021
by   Peidong Liu, et al.
0

Compared with tedious per-pixel mask annotating, it is much easier to annotate data by clicks, which costs only several seconds for an image. However, applying clicks to learn video semantic segmentation model has not been explored before. In this work, we propose an effective weakly-supervised video semantic segmentation pipeline with click annotations, called WeClick, for saving laborious annotating effort by segmenting an instance of the semantic class with only a single click. Since detailed semantic information is not captured by clicks, directly training with click labels leads to poor segmentation predictions. To mitigate this problem, we design a novel memory flow knowledge distillation strategy to exploit temporal information (named memory flow) in abundant unlabeled video frames, by distilling the neighboring predictions to the target frame via estimated motion. Moreover, we adopt vanilla knowledge distillation for model compression. In this case, WeClick learns compact video semantic segmentation models with the low-cost click annotations during the training phase yet achieves real-time and accurate models during the inference period. Experimental results on Cityscapes and Camvid show that WeClick outperforms the state-of-the-art methods, increases performance by 10.24

READ FULL TEXT

page 2

page 4

page 5

research
04/13/2018

Learning to Exploit the Prior Network Knowledge for Weakly-Supervised Semantic Segmentation

Training a Convolutional Neural Network (CNN) for semantic segmentation ...
research
06/09/2019

Movable-Object-Aware Visual SLAM via Weakly Supervised Semantic Segmentation

Moving objects can greatly jeopardize the performance of a visual simult...
research
03/23/2021

Weakly Supervised Instance Segmentation for Videos with Temporal Mask Consistency

Weakly supervised instance segmentation reduces the cost of annotations ...
research
09/20/2019

Weakly Supervised Semantic Segmentation Using Constrained Dominant Sets

The availability of large-scale data sets is an essential pre-requisite ...
research
02/26/2020

Efficient Semantic Video Segmentation with Per-frame Inference

In semantic segmentation, most existing real-time deep models trained wi...
research
09/14/2023

Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation

Modern approaches have proved the huge potential of addressing semantic ...
research
12/04/2018

Improving Semantic Segmentation via Video Propagation and Label Relaxation

Semantic segmentation requires large amounts of pixel-wise annotations t...

Please sign up or login with your details

Forgot password? Click here to reset