FALCON: FAst and Lightweight CONvolution for Compressing and Accelerating Convolutional Neural Networks

Chun Quan

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

FALCON: FAst and Lightweight CONvolution for Compressing and Accelerating Convolutional Neural Networks

DC Field	Value	Language
dc.contributor.advisor	Kang, U	-
dc.contributor.author	Chun Quan	-
dc.date.accessioned	2019-10-18T15:44:49Z	-
dc.date.available	2019-10-18T15:44:49Z	-
dc.date.issued	2019-08	-
dc.identifier.other	000000157911	-
dc.identifier.uri	https://hdl.handle.net/10371/161072	-
dc.identifier.uri	http://dcollection.snu.ac.kr/common/orgView/000000157911	ko_KR
dc.description	학위논문(석사)--서울대학교 대학원 :공과대학 컴퓨터공학부,2019. 8. Kang, U.	-
dc.description.abstract	How can we efficiently compress Convolution Neural Networks (CNN) while maintaining the accuracy of classification tasks? One of the promising approaches is based on depthwise separable convolution which replaces a standard convolution with a depthwise convolution and pointwise convolution. However, previous works based on the depthwise separable convolution are limited since 1) they are mostly heuristic approaches without precise understanding of their relations to the standard convolution, and 2) their accuracies cannot match that of the standard convolution. In this paper, we propose FALCON, an accurate and lightweight method for compressing CNN. FALCON is derived by interpreting existing convolution methods based on depthwise separable convolution using EHP, our proposed mathematical formulation to approximate the standard convolution kernel. Such interpretation leads to developing a generalized version rank-k FALCON which further improves the accuracy while sacrificing a bit of compression and computation. Experiments show that FALCON outperforms 1) existing methods based on depthwise separable convolution, and 2) the standard CNN model by up to 8× compression and 8× computation reduction while ensuring similar accuracy. We also demonstrate that rank-k FALCON provides even better accuracy than the standard convolution in many cases, while using smaller numbers of parameters and floating point operations.	-
dc.description.tableofcontents	I. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 II. Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 2.1 Convolution Neural Network . . . . . . . . . . . . . . . . . . . . 4 2.2 Depthwise Separable Convolution . . . . . . . . . . . . . . . . . . 6 2.3 Methods Based on Depthwise Separable Convolution . . . . . . . 9 2.4 Hadamard Product . . . . . . . . . . . . . . . . . . . . . . . . . . 10 III. Proposed Method . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 3.1 Extended Hadamard Product (EHP) . . . . . . . . . . . . . . . . 12 3.2 Depthwise Separable Convolution and EHP . . . . . . . . . . . . 14 3.3 FAst and Lightweight CONvolution (Falcon) . . . . . . . . . . . 16 3.4 Rank-k Falcon . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 3.5 Quantitative Analysis . . . . . . . . . . . . . . . . . . . . . . . . 20 3.5.1 Falcon . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20 3.5.2 Rank-k Falcon . . . . . . . . . . . . . . . . . . . . . . . 22 IV. Experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 4.1 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . . 23 4.2 Fitting Convolution Unit into Model . . . . . . . . . . . . . . . . 25 4.3 Accuracy vs. Compression . . . . . . . . . . . . . . . . . . . . . . 32 4.4 Accuracy vs. Computation . . . . . . . . . . . . . . . . . . . . . . 32 4.5 Rank-k Falcon . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33 V. Related Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36 VI. Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 A Generality of EHP . . . . . . . . . . . . . . . . . . . . . . . . . . 43 B Parameters and FLOPs . . . . . . . . . . . . . . . . . . . . . . . 44 Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45	-
dc.language.iso	eng	-
dc.publisher	서울대학교 대학원	-
dc.subject	CNN compression	-
dc.subject	CNN acceleration	-
dc.subject	convolution	-
dc.subject.ddc	621.39	-
dc.title	FALCON: FAst and Lightweight CONvolution for Compressing and Accelerating Convolutional Neural Networks	-
dc.type	Thesis	-
dc.type	Dissertation	-
dc.contributor.AlternativeAuthor	쿠안춘	-
dc.contributor.department	공과대학 컴퓨터공학부	-
dc.description.degree	Master	-
dc.date.awarded	2019-08	-
dc.identifier.uci	I804:11032-000000157911	-
dc.identifier.holdings	000000000040▲000000000041▲000000157911▲	-

Appears in Collections:

College of Engineering/Engineering Practice School (공과대학/대학원)
- Dept. of Computer Science and Engineering (컴퓨터공학부)
  - Theses (Master's Degree_컴퓨터공학부)

Files in This Item:

000000157911.pdf 1.76 MB

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share