Fast 2D Convolutions and Cross-Correlations Using Scalable Architectures

12/24/2021
by   Cesar Carranza, et al.
0

The manuscript describes fast and scalable architectures and associated algorithms for computing convolutions and cross-correlations. The basic idea is to map 2D convolutions and cross-correlations to a collection of 1D convolutions and cross-correlations in the transform domain. This is accomplished through the use of the Discrete Periodic Radon Transform (DPRT) for general kernels and the use of SVD-LU decompositions for low-rank kernels. The approach uses scalable architectures that can be fitted into modern FPGA and Zynq-SOC devices. Based on different types of available resources, for P× P blocks, 2D convolutions and cross-correlations can be computed in just O(P) clock cycles up to O(P^2) clock cycles. Thus, there is a trade-off between performance and required numbers and types of resources. We provide implementations of the proposed architectures using modern programmable devices (Virtex-7 and Zynq-SOC). Based on the amounts and types of required resources, we show that the proposed approaches significantly outperform current methods.

READ FULL TEXT
research
12/24/2021

Fast and Scalable Computation of the Forward and Inverse Discrete Periodic Radon Transform

The Discrete Periodic Radon Transform (DPRT) has been extensively used i...
research
03/30/2020

Rethinking Depthwise Separable Convolutions: How Intra-Kernel Correlations Lead to Improved MobileNets

We introduce blueprint separable convolutions (BSConv) as highly efficie...
research
06/17/2020

Optimizing Grouped Convolutions on Edge Devices

When deploying a deep neural network on constrained hardware, it is poss...
research
05/28/2019

Deep Scale-spaces: Equivariance Over Scale

We introduce deep scale-spaces (DSS), a generalization of convolutional ...
research
04/10/2019

Cross-Platform Performance Portability Using Highly Parametrized SYCL Kernels

Over recent years heterogeneous systems have become more prevalent acros...
research
04/17/2018

IGCV2: Interleaved Structured Sparse Convolutional Neural Networks

In this paper, we study the problem of designing efficient convolutional...
research
03/30/2023

Hybrid Dealiasing of Complex Convolutions

Efficient algorithms for computing linear convolutions based on the fast...

Please sign up or login with your details

Forgot password? Click here to reset