Task Oriented Video Coding: A Survey

08/15/2022
by   Daniel Wood, et al.
0

Video coding technology has been continuously improved for higher compression ratio with higher resolution. However, the state-of-the-art video coding standards, such as H.265/HEVC and Versatile Video Coding, are still designed with the assumption the compressed video will be watched by humans. With the tremendous advance and maturation of deep neural networks in solving computer vision tasks, more and more videos are directly analyzed by deep neural networks without humans' involvement. Such a conventional design for video coding standard is not optimal when the compressed video is used by computer vision applications. While the human visual system is consistently sensitive to the content with high contrast, the impact of pixels on computer vision algorithms is driven by specific computer vision tasks. In this paper, we explore and summarize recent progress on computer vision task oriented video coding and emerging video coding standard, Video Coding for Machines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/04/2022

Scalable Video Coding for Humans and Machines

Video content is watched not only by humans, but increasingly also by ma...
research
03/11/2022

Video Coding for Machines with Feature-Based Rate-Distortion Optimization

Common state-of-the-art video codecs are optimized to deliver a low bitr...
research
12/05/2017

AI Oriented Large-Scale Video Management for Smart City: Technologies, Standards and Beyond

Deep learning has achieved substantial success in a series of tasks in c...
research
01/07/2022

Video Coding for Machines: Partial transmission of SIFT features

The paper deals with Video Coding for Machines that is a new paradigm in...
research
09/27/2017

Fast Convolutional Sparse Coding in the Dual Domain

Convolutional sparse coding (CSC) is an important building block of many...
research
01/07/2021

Learning Grammar of Complex Activities via Deep Neural Networks

Motivated by the growing amount of publicly available video data on onli...
research
11/23/2022

Pruned Lightweight Encoders for Computer Vision

Latency-critical computer vision systems, such as autonomous driving or ...

Please sign up or login with your details

Forgot password? Click here to reset