Publications

Detailed Information

A Unified Approach on Bayesian Optimization of Deep Neural Network : 심층 신경망의 베이지안 최적화에 대한 통합적 접근법

DC Field Value Language
dc.contributor.advisor이원종-
dc.contributor.author노승은-
dc.date.accessioned2017-10-31T08:21:34Z-
dc.date.available2017-10-31T08:21:34Z-
dc.date.issued2017-08-
dc.identifier.other000000145547-
dc.identifier.urihttps://hdl.handle.net/10371/137956-
dc.description학위논문 (석사)-- 서울대학교 융합과학기술대학원 융합과학부, 2017. 8. 이원종.-
dc.description.abstractDue to the success of deep neural network (DNN) in recent years, DNN has begun to be applied in many research and application fields. At the same time, optimizing the hyper-parameters of DNNs has become an important issue because the performance of network is very sensitive to the setting of the hyper-parameters, such as the size of the convolutional filter or the number of neurons in the fully connected layer. Therefore, active research is being conducted on hyper-parameter optimization (HPO), and the Bayesian optimization method has shown promising results. However, there are some limitations with the Bayesian optimization. First, there still exists various options that need to be set in the optimization process itself-
dc.description.abstractmodels and acquisition functions. Although the optimization varies greatly depending on the model and the acquisition function, we do not know which option is the best for a given dataset. Therefore, currently, it is all about choosing one that seems appropriate and hoping to be lucky. Second, the acquisition functions do not reflect the time-cost, which leads to time-inefficient selections. If we can predict how much time a candidate will spend beforehand, we can make a more cost-efficient choice.
In order to solve these two problems, we propose the following method. First, we suggest a unifying approach which adapts to the problem and dataset within a single optimization process. It first explores which models and acquisition function performs well, and then converges to well performing options to exploit them. Second, we devise the acquisition function that takes time-cost into account. To do so, we must be able to predict the training time of the DNN. Hence we also propose a fast and accurate method to predict the training time of the DNN. Based on these two methods, we tried to improve Bayesian optimization, and we conducted several experiments to confirm the effect of unifying approach and time-efficient acquisition functions. Through the experiment, we achieve improvements in terms of time consumption and performance.
-
dc.description.tableofcontentsI. Introduction 1
1.1 The Success of Deep Neural Networks 1
1.2 Hyper-parameter Optimization 3
1.3 Overview of this Study 5

II. Bayesian Optimization 7
2.1 Introduction to Bayesian Optimization 7
2.2 Models 9
2.3 Acquisition Functions 12

III. Research Questions and Related Works 16
3.1 Research Questions 16
3.2 Related Works for Unifying Approach 18
3.3 Related Works for Cost-Efficient Acquisition Function 23

IV. Method 27
4.1 Pre-evaluated Look-up Table 27
4.2 Unifying Models and Acquisition Functions 29
4.3 Time Efficient Acquisition Functions 32

V. Experiment Results 35
5.1 Experimental Setup 35
5.2 Unifying Models 37
5.3 Unifying Models and Acquisition Functions 40
5.4 Time-Efficient Acquisition Functions 44

VI. Conclusion 52
6.1 Summary 52
6.2 Contributions 53
6.3 Limitations 54

References 56
-
dc.formatapplication/pdf-
dc.format.extent2007310 bytes-
dc.format.mediumapplication/pdf-
dc.language.isoen-
dc.publisher서울대학교 융합과학기술대학원-
dc.subjectBayesian Optimization-
dc.subjectHyper-parameter Optimization-
dc.subjectDeep Neural Network-
dc.subjectMulti-armed Bandit-
dc.subjectAcquisition Function-
dc.subject.ddc620.5-
dc.titleA Unified Approach on Bayesian Optimization of Deep Neural Network-
dc.title.alternative심층 신경망의 베이지안 최적화에 대한 통합적 접근법-
dc.typeThesis-
dc.contributor.AlternativeAuthorSeungeun Rho-
dc.description.degreeMaster-
dc.contributor.affiliation융합과학기술대학원 융합과학부-
dc.date.awarded2017-08-
Appears in Collections:
Files in This Item:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share