Publications

Detailed Information

Weighted-entropy-based quantization for deep neural networks

Cited 0 time in Web of Science Cited 179 time in Scopus
Authors

Park, Eunhyeok; Ahn, Junwhan; Yoo, Sungjoo

Issue Date
2017-11
Publisher
Institute of Electrical and Electronics Engineers Inc.
Citation
Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Vol.2017-January, pp.7197-7205
Abstract
© 2017 IEEE. Quantization is considered as one of the most effective methods to optimize the inference cost of neural network models for their deployment to mobile and embedded systems, which have tight resource constraints. In such approaches, it is critical to provide low-cost quantization under a tight accuracy loss constraint (e.g., 1%). In this paper, we propose a novel method for quantizing weights and activations based on the concept of weighted entropy. Unlike recent work on binary-weight neural networks, our approach is multi-bit quantization, in which weights and activations can be quantized by any number of bits depending on the target accuracy. This facilitates much more flexible exploitation of accuracy-performance trade-off provided by different levels of quantization. Moreover, our scheme provides an automated quantization flow based on conventional training algorithms, which greatly reduces the design-time effort to quantize the network. According to our extensive evaluations based on practical neural network models for image classification (AlexNet, GoogLeNet and ResNet-50/101), object detection (R-FCN with ResNet-50), and language modeling (an LSTM network), our method achieves significant reductions in both the model size and the amount of computation with minimal accuracy loss. Also, compared to existing quantization schemes, ours provides higher accuracy with a similar resource constraint and requires much lower design effort.
URI
https://hdl.handle.net/10371/192885
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share