DATE 2019

Guaranteed Compression Rate for Activations in CNNs using a Frequency Pruning Approach

Sebastian Vogel^1,a, Christoph Schorn^1,b, Andre Guntoro^1,c and Gerd Ascheid²
¹Robert Bosch GmbH, Renningen, Germany
^asebastian.vogel@de.bosch.com
^bchristoph.schorn@de.bosch.com
^candre.guntoro@de.bosch.com
²RWTH Aachen University, Aachen, Germany
gerd.ascheid@ice.rwth-aachen.de

ABSTRACT

Convolutional Neural Networks have become state of the art for many computer vision tasks. However, the size of Neural Networks prevents their application in resource constrained systems. In this work, we present a lossy compression technique for intermediate results of Convolutional Neural Networks. The proposed method offers guaranteed compression rates and additionally adapts to performance requirements. Our experiments with networks for classification and semantic segmentation show, that our method outperforms state-of-the-art compression techniques used in CNN accelerators.

Keywords: Deep Neural Networks, Convolutional Neural Networks, Compression, Embedded Systems.

Full Text (PDF)