Aggregating Long-term Sharp Features via Hybrid Transformers for Video Deblurring

09/13/2023
by   Dongwei Ren, et al.
0

Video deblurring methods, aiming at recovering consecutive sharp frames from a given blurry video, usually assume that the input video suffers from consecutively blurry frames. However, in real-world blurry videos taken by modern imaging devices, sharp frames usually appear in the given video, thus making temporal long-term sharp features available for facilitating the restoration of a blurry frame. In this work, we propose a video deblurring method that leverages both neighboring frames and present sharp frames using hybrid Transformers for feature aggregation. Specifically, we first train a blur-aware detector to distinguish between sharp and blurry frames. Then, a window-based local Transformer is employed for exploiting features from neighboring frames, where cross attention is beneficial for aggregating features from neighboring frames without explicit spatial alignment. To aggregate long-term sharp features from detected sharp frames, we utilize a global Transformer with multi-scale matching capability. Moreover, our method can easily be extended to event-driven video deblurring by incorporating an event fusion module into the global Transformer. Extensive experiments on benchmark datasets demonstrate that our proposed method outperforms state-of-the-art video deblurring methods as well as event-driven video deblurring methods in terms of quantitative metrics and visual quality. The source code and trained models are available at https://github.com/shangwei5/STGTN.

READ FULL TEXT

page 1

page 9

page 10

page 11

page 12

research
01/28/2022

VRT: A Video Restoration Transformer

Video restoration (e.g., video super-resolution) aims to restore high-qu...
research
03/29/2022

VPTR: Efficient Transformers for Video Prediction

In this paper, we propose a new Transformer block for video future frame...
research
06/05/2022

Recurrent Video Restoration Transformer with Guided Deformable Attention

Video restoration aims at restoring multiple high-quality frames from mu...
research
03/07/2021

ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring

Video deblurring models exploit consecutive frames to remove blurs from ...
research
02/06/2021

CMS-LSTM: Context-Embedding and Multi-Scale Spatiotemporal-Expression LSTM for Video Prediction

Extracting variation and spatiotemporal features via limited frames rema...
research
08/04/2021

Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction

A number of deep learning based algorithms have been proposed to recover...
research
03/03/2022

E-CIR: Event-Enhanced Continuous Intensity Recovery

A camera begins to sense light the moment we press the shutter button. D...

Please sign up or login with your details

Forgot password? Click here to reset