Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency

04/10/2019
by   Jia Li, et al.
16

The performance of video saliency estimation techniques has achieved significant advances along with the rapid development of Convolutional Neural Networks (CNNs). However, devices like cameras and drones may have limited computational capability and storage space so that the direct deployment of complex deep saliency models becomes infeasible. To address this problem, this paper proposes a dynamic saliency estimation approach for aerial videos via spatiotemporal knowledge distillation. In this approach, five components are involved, including two teachers, two students and the desired spatiotemporal model. The knowledge of spatial and temporal saliency is first separately transferred from the two complex and redundant teachers to their simple and compact students, and the input scenes are also degraded from high-resolution to low-resolution to remove the probable data redundancy so as to greatly speed up the feature extraction process. After that, the desired spatiotemporal model is further trained by distilling and encoding the spatial and temporal saliency knowledge of two students into a unified network. In this manner, the inter-model redundancy can be further removed for the effective estimation of dynamic saliency on aerial videos. Experimental results show that the proposed approach outperforms ten state-of-the-art models in estimating visual saliency on aerial videos, while its speed reaches up to 28,738 FPS on the GPU platform.

READ FULL TEXT

page 1

page 2

page 8

page 9

page 11

research
11/14/2018

How Drones Look: Crowdsourced Knowledge Transfer for Aerial Video Saliency Prediction

In ground-level platforms, many saliency models have been developed to p...
research
04/09/2019

Ultrafast Video Attention Prediction with Coupled Knowledge Distillation

Large convolutional neural network models have recently demonstrated imp...
research
02/02/2017

Video Salient Object Detection via Fully Convolutional Networks

This paper proposes a deep learning model to efficiently detect salient ...
research
10/20/2020

Fast Video Salient Object Detection via Spatiotemporal Knowledge Distillation

Since the wide employment of deep learning frameworks in video salient o...
research
01/06/2019

Unsupervised uncertainty estimation using spatiotemporal cues in video saliency detection

In this paper, we address the problem of quantifying reliability of comp...
research
02/15/2021

A Gated Fusion Network for Dynamic Saliency Prediction

Predicting saliency in videos is a challenging problem due to complex mo...
research
07/25/2017

Graph-Theoretic Spatiotemporal Context Modeling for Video Saliency Detection

As an important and challenging problem in computer vision, video salien...

Please sign up or login with your details

Forgot password? Click here to reset