Publications

Detailed Information

Extreme Partial-Sum Quantization for Analog Computing-In-Memory Neural Network Accelerators

Cited 2 time in Web of Science Cited 0 time in Scopus
Authors

Kim, Yulhwa; Kim, Hyungjun; Kim, Jae-Joon

Issue Date
2022-10
Publisher
Association for Computing Machinary, Inc.
Citation
ACM Journal on Emerging Technologies in Computing Systems, Vol.18 No.4, p. 75
Abstract
In Analog Computing-in-Memory (CIM) neural network accelerators, analog-to-digital converters (ADCs) are required to convert the analog partial sums generated from a CIM array to digital values. The overhead from ADCs substantially degrades the energy efficiency of CIM accelerators so that previous works attempted to lower the ADC resolution considering the distribution of the partial sums. Despite the efforts, the required ADC resolution still remains relatively high. In this article, we propose the data-driven partial sum quantization scheme, which exhaustively searches for the optimal quantization range with little computational burden. We also report that analyzing the characteristics of the partial sum distributions at each layer gives an additional information to further reduce the ADC resolution compared to previous works that mostly used the characteristics of the partial sum distributions of the entire network. Based on the finer-level data-driven approach combined with retraining, we present a methodology for extreme partial-sum quantization. Experimental results show that the proposed method can reduce the ADC resolution to 2 to 3 bits for CIFAR-10 dataset, which is the smaller ADC bit resolution than any previous CIM-based NN accelerators.
ISSN
1550-4832
URI
https://hdl.handle.net/10371/188964
DOI
https://doi.org/10.1145/3528104
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share