Neural Tangent Kernel Analysis of Deep Narrow Neural Networks

이종민

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

Neural Tangent Kernel Analysis of Deep Narrow Neural Networks : 좁고 깊은 심층신경망의 뉴럴 탄젠트 커널 분석

DC Field	Value	Language
dc.contributor.advisor	Ernest K. Ryu	-
dc.contributor.author	이종민	-
dc.date.accessioned	2023-11-20T04:49:35Z	-
dc.date.available	2023-11-20T04:49:35Z	-
dc.date.issued	2023	-
dc.identifier.other	000000179443	-
dc.identifier.uri	https://hdl.handle.net/10371/197304	-
dc.identifier.uri	https://dcollection.snu.ac.kr/common/orgView/000000179443	ko_KR
dc.description	학위논문(석사) -- 서울대학교대학원 : 자연과학대학 수리과학부, 2023. 8. Ernest K. Ryu.	-
dc.description.abstract	The tremendous recent progress in analyzing the training dynamics of over parameterized neural networks has primarily focused on wide networks and therefore does not sufficiently address the role of depth in deep learning. In this work, we present the first trainability guarantee of infinitely deep but narrow neural networks. We study the infinite-depth limit of a multilayer perceptron (MLP) with a specific initialization and establish a trainability guarantee using the NTK theory. We then extend the analysis to an infinitely deep convolutional neural network (CNN) and perform brief experiments.	-
dc.description.abstract	과매개화된 신경망의 훈련 역학을 분석하는 최근의 엄청난 발전은 주로 넓은 네트워크에 초점을 맞추었기 때문에 딥 러닝에서 깊이의 역할을 충분히 다루지 못 한다. 이 논문에서 우리는 무한히 깊지만 좁은 신경망의 훈련 가능성을 처음으로 보인다. 우리는 특정 초기화하에서 무한한 깊이의 다층 신경망을 연구하고 뉴럴 탄젠트커널 이론을 사용하여 학습 가능성울 보장한다. 그런 다음 분석을 무한히 깊은 합성곱 신경망으로 확장하고 간단한 실험을 수행한다.	-
dc.description.tableofcontents	1 Introduction 2 1.1 Prior works 3 2 Preliminaries and Notations 5 2.1 Kernel gradient flow 6 2.2 Neural tangent kernel 7 3 NTK analysis of infinitely deep MLP 9 3.1 Initialization 10 3.1.1 Gradient flow and neural tangent kernel 11 3.2 Convergence in infinite-depth limit 13 3.3 Proof outline 15 4 NTK analysis of infinitely deep CNN 19 4.0.1 Initialization 20 4.0.2 Convergence in infinite-depth limit 22 5 Experiments 24 5.1 Convergence of the scaled NTK 24 5.2 Trainability of the deep narrow neural network 24 5.3 Accumulation of the layer-wise effect 26 6 Conclusion 27 7 Appendix 28 Bibliography 29 초 록 37 감사의 글 38	-
dc.format.extent	vi, 38	-
dc.language.iso	eng	-
dc.publisher	서울대학교 대학원	-
dc.subject	Neural tangent kernel	-
dc.subject	Gradient flow	-
dc.subject	Deep narrow neural network	-
dc.subject	Artificial intelligence	-
dc.subject.ddc	510	-
dc.title	Neural Tangent Kernel Analysis of Deep Narrow Neural Networks	-
dc.title.alternative	좁고 깊은 심층신경망의 뉴럴 탄젠트 커널 분석	-
dc.type	Thesis	-
dc.type	Dissertation	-
dc.contributor.AlternativeAuthor	Jongmin Lee	-
dc.contributor.department	자연과학대학 수리과학부	-
dc.description.degree	석사	-
dc.date.awarded	2023-08	-
dc.identifier.uci	I804:11032-000000179443	-
dc.identifier.holdings	000000000050▲000000000058▲000000179443▲	-

Appears in Collections:

College of Natural Sciences (자연과학대학)
- Dept. of Mathematical Sciences (수리과학부)
  - Theses (Master's Degree_수리과학부)

Files in This Item:

000000179443.pdf 2.67 MB

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share