Accelerating Fully Spectral CNNs with Adaptive Activation Functions on FPGA

Shuanglong Liu1, Hongxiang Fan2,a and Wayne Luk2,b
1Key Laboratory of Low-Dimensional Quantum Structures and Quantum Control, School of Physics and Electronics Hunan Normal University, Changsha, China
liu.shuanglong@hunnu.edu.cn
2Dept. of Computing, Imperial College London, UK
ah.fan17@imperial.ac.uk
bw.luk@imperial.ac.uk

ABSTRACT


Computing convolutional layers in frequency domain can largely reduce the computation overhead for training and inference of convolutional neural networks (CNNs). However, existing designs with such an idea require repeated spatial- and frequency-domain transforms due to the absence of nonlinear functions in the frequency domain, as such it makes the benefit less attractive for low-latency inference. This paper presents a fully spectral CNN approach by proposing a novel adaptive Rectified Linear Unit (ReLU) activation in spectral domain. The proposed design maintains the non-linearity in the network while taking into account the hardware efficiency in algorithm level. The spectral model size is further optimized by merging and fusing layers. Then, a customized hardware architecture is proposed to implement the designed spectral network on FPGA device with DSP optimizations for 8-bit fixed point multipliers. Our hardware accelerator is implemented on Intel’s Arria 10 device and applied to the MNIST, SVHN, AT&T and CIFAR-10 datasets. Experimental results show a speed improvement of 6⨯ ∼ 10⨯ and 4⨯ ∼ 5.7⨯ compared to state-of-the-art spatial or FFT-based designs respectively, while achieving similar accuracy across the benchmark datasets.



Full Text (PDF)