WRAP: Weight RemApping and Processing in RRAM-based Neural Network Accelerators Considering Thermal Effect

Po-Yuan Chen1, Fang-Yi Gu2,a, Yu-Hong Huang2,b and Ing-Chao Lin2,c
1Dept. of Electrical Engineering National Cheng Kung University, Tainan, R.O.C
pychen121@gmail.com
2Dept. of Computer Science and Information Engineering National Cheng Kung University, Tainan, R.O.C
ap78101556@gs.ncku.edu.tw
bp76801360@gs.ncku.edu.tw
ciclin@gs.ncku.edu.tw

ABSTRACT


Resistive random-access memory (RRAM) has shown great potential for computing in memory (CIM) to support the requirements of high memory bandwidth and low power in neuromorphic computing systems. However, the accuracy of RRAM-based neural network (NN) accelerators can degrade significantly due to the intrinsic statistical variations of the resistance of RRAM cells, as well as the negative effects of high temperatures. In this paper, we propose a subarray-based thermal-aware weight remapping and processing framework (WRAP) to map the weights of a neural network model into RRAM subarrays. Instead of dealing with each weight individually, this framework maps weights into subarrays and performs subarray-based algorithms to reduce computational complexity while maintaining accuracy under thermal impact. Experimental results demonstrate that using our framework, inference accuracy losses of four DNN models are less than 2% compared to the ideal results and 1% with compensation applied even when the surrounding temperature is around 360K.



Full Text (PDF)