Publications
Detailed Information
Bayesian Neural Bandit Using Online SWAG : Bayesian Neural Bandit Using Online SWAG
Cited 0 time in
Web of Science
Cited 0 time in Scopus
- Authors
- Advisor
- 오민환
- Issue Date
- 2022
- Publisher
- 서울대학교 대학원
- Keywords
- ContextualBandit ; BayesianDeepLearning ; NeuralBandit ; StochasticWeightAveragingGaussian(SWAG)
- Description
- 학위논문(석사) -- 서울대학교대학원 : 데이터사이언스대학원 데이터사이언스학과, 2022. 8. 오민환.
- Abstract
- In this paper, we propose a Neural SWAG Bandit algorithm that combines a neural network-based bandit algorithm with Stochastic Weight Averaging Gaussian (SWAG), a Bayesian deep learning methodology. Neural Bandit is a bandit algorithm that uses the output of neural networks as an estimated reward. SWAG is a Bayesian Deep Learning method that samples parameters from the gaussian posterior distribution, which has been shown to have state-of-the-art performance and robustness compared to benchmark algorithms. By adapting SWAG into an online setting and combining it with Neural Bandit, we can leverage efficient sampling from deep neural networks while learning online. Our experiment results indicate that Neural SWAG Bandit benefits from Bayesian deep learning as well as exhibits superior performance compared to existing benchmark algorithms.
- Language
- eng
- Files in This Item:
- Appears in Collections:
Item View & Download Count
Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.