Deep Reinforcement Learning with LSTM-based Exploration Bonus

양진호

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

Deep Reinforcement Learning with LSTM-based Exploration Bonus

DC Field	Value	Language
dc.contributor.advisor	유석인	-
dc.contributor.author	양진호	-
dc.date.accessioned	2017-07-14T02:36:19Z	-
dc.date.available	2017-07-14T02:36:19Z	-
dc.date.issued	2017-02	-
dc.identifier.other	000000140903	-
dc.identifier.uri	https://hdl.handle.net/10371/122687	-
dc.description	학위논문 (석사)-- 서울대학교 대학원 : 컴퓨터공학부, 2017. 2. 유석인.	-
dc.description.abstract	Deep learning is dominant method in recent machine learning community. Deep learning outperforms traditional methods in various tasks such as image classification, object recognition, speech recognition, and natural language processing. However, most deep learning algorithm is focused on supervised learning task. Supervised learning only considers static environment therefore it is not proper to dynamic environment. To solve this problem, reinforcement learning method combined with deep learning called deep reinforcement learning is proposed. Deep reinforcement learning is composed of two parts. First part is feature extraction using deep learning method. Second part is learning proper action for agent using reinforcement learning through trial and error. However, reinforcement learning algorithm has exploration and exploration trade-off problem. It is hard to find optimal exploration and exploitation ratio for reinforcement learning. To solve this problem, we propose novel deep reinforcement learning algorithm with LSTM-based exploration bonus. LSTM-based exploration bonus uses Long-Short Term Memory (LSTM) networks as predictor and makes exploration bonus based on prediction error. LSTM-based exploration bonus guides agent play more daring action. As a result, agent could find optimal solution in more short time. We test our method playing various Atari games. Experimental results shows our method outperforms plain deep reinforcement learning method. This shows the effectiveness of our method.	-
dc.description.tableofcontents	Introduction 1 Related Works 3 Deep Reinforcement Learning 8 Experiment 23 Conclusion 27 Bibliography 28 요약 33	-
dc.format	application/pdf	-
dc.format.extent	3028176 bytes	-
dc.format.medium	application/pdf	-
dc.language.iso	en	-
dc.publisher	서울대학교 대학원	-
dc.subject	Deep Learning	-
dc.subject	Deep Reinforcement Learning	-
dc.subject	Long-Short Term Memory Networks	-
dc.subject	Exploration and Exploitation Trade-O ff	-
dc.subject.ddc	621	-
dc.title	Deep Reinforcement Learning with LSTM-based Exploration Bonus	-
dc.type	Thesis	-
dc.description.degree	Master	-
dc.citation.pages	33	-
dc.contributor.affiliation	공과대학 컴퓨터공학부	-
dc.date.awarded	2017-02	-

Appears in Collections:

College of Engineering/Engineering Practice School (공과대학/대학원)
- Dept. of Computer Science and Engineering (컴퓨터공학부)
  - Theses (Master's Degree_컴퓨터공학부)

Files in This Item:

000000140903.pdf 2.89 MB

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share