One-Shot Video Inpainting

02/28/2023
by   Sangjin Lee, et al.
0

Recently, removing objects from videos and filling in the erased regions using deep video inpainting (VI) algorithms has attracted considerable attention. Usually, a video sequence and object segmentation masks for all frames are required as the input for this task. However, in real-world applications, providing segmentation masks for all frames is quite difficult and inefficient. Therefore, we deal with VI in a one-shot manner, which only takes the initial frame's object mask as its input. Although we can achieve that using naive combinations of video object segmentation (VOS) and VI methods, they are sub-optimal and generally cause critical errors. To address that, we propose a unified pipeline for one-shot video inpainting (OSVI). By jointly learning mask prediction and video completion in an end-to-end manner, the results can be optimal for the entire task instead of each separate module. Additionally, unlike the two stage methods that use the predicted masks as ground truth cues, our method is more reliable because the predicted masks can be used as the network's internal guidance. On the synthesized datasets for OSVI, our proposed method outperforms all others both quantitatively and qualitatively.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 11

research
08/14/2022

Semi-Supervised Video Inpainting with Cycle Consistency Constraints

Deep learning-based video inpainting has yielded promising results and g...
research
05/13/2023

AURA : Automatic Mask Generator using Randomized Input Sampling for Object Removal

The objective of the image inpainting task is to fill missing regions of...
research
08/15/2021

Occlusion-Aware Video Object Inpainting

Conventional video inpainting is neither object-oriented nor occlusion-a...
research
08/15/2020

Curriculum Learning for Recurrent Video Object Segmentation

Video object segmentation can be understood as a sequence-to-sequence ta...
research
06/01/2022

Differentiable Soft-Masked Attention

Transformers have become prevalent in computer vision due to their perfo...
research
06/22/2018

Video Inpainting by Jointly Learning Temporal Structure and Spatial Details

We present a new data-driven video inpainting method for recovering miss...
research
01/25/2023

Efficient Flow-Guided Multi-frame De-fencing

Taking photographs ”in-the-wild” is often hindered by fence obstructions...

Please sign up or login with your details

Forgot password? Click here to reset