A Pulse-width Modulation Neuron with Continuous Activation for Processing-In-Memory Engines

Shuhang Zhang1,2,a, Bing Li1,b, Hai (Helen) Li2,3,c and Ulf Schlichtmann1,d

1Chair of Electronic Design Automation
2Institute for Advanced Study, Technical University of Munich (TUM), Munich, Germany
3Department of Electrical and Computer Engineering, Duke University, Durham, NC, United States
ashuhang.zhang@tum.de
bb.li@tum.de
culf.schlichtmann@tum.de
dhai.li@duke.edu

ABSTRACT

Processing-in-memory engines have successfully been applied to accelerate deep neural networks. For improving computing efficiency, spiking-based designs are widely explored. However, spiking-based designs quantize inter-layer signals naturally, leading to performance loss. In addition, the spike mismatch effect makes digital processing necessary, impeding direct signal transfer between layers and thus resulting in longer latency. In this paper, we propose a novel neuron design based on pulse width modulation, avoiding the quantization step and bypassing spike mismatch via the continuous activation. The computation latency and circuit complexity can significantly be reduced due to the absence of quantization and digital processing steps, while keeping a competitive performance. Simulation results show that the proposed neuron design can achieve > 100× speedup compared with spiking-based designs. The area and power consumption can be reduced up to 74:87% and 25:63%.



Full Text (PDF)