Understanding features on evolutionary policy optimizations: Feature learning difference between gradient-based and evolutionary policy optimizations

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

Understanding features on evolutionary policy optimizations: Feature learning difference between gradient-based and evolutionary policy optimizations

Cited 0 time in Web of Science Cited 1 time in Scopus

Citation: Proceedings of the ACM Symposium on Applied Computing, pp.1112-1118

Abstract: © 2020 ACM.We analyze two deep reinforcement learning algorithms, gradient-based policy optimization and evolutionary one, by a number of visualization techniques and supplement experiments. As such techniques, filter visualization and saliency map are used to examine whether meaningful features properly extracted in the two algorithms. In addition to visual analysis, some experiments are devised to enhance the validity of the analysis. We observed that an evolutionary policy optimization tends to make use of the prior knowledge and learn the prior action distribution of the policy by a powerful exploration ability, which a gradient-based algorithm cannot do easily.

Appears in Collections:

Show Full Item Record

Find it @ SNU

SNS Share