Publications

Detailed Information

Understanding features on evolutionary policy optimizations: Feature learning difference between gradient-based and evolutionary policy optimizations

Cited 0 time in Web of Science Cited 1 time in Scopus
Authors

Lee, Sangyeop; Ha, Myoung Hoon; Moon, Byungro

Issue Date
2020-03
Publisher
Association for Computing Machinery
Citation
Proceedings of the ACM Symposium on Applied Computing, pp.1112-1118
Abstract
© 2020 ACM.We analyze two deep reinforcement learning algorithms, gradient-based policy optimization and evolutionary one, by a number of visualization techniques and supplement experiments. As such techniques, filter visualization and saliency map are used to examine whether meaningful features properly extracted in the two algorithms. In addition to visual analysis, some experiments are devised to enhance the validity of the analysis. We observed that an evolutionary policy optimization tends to make use of the prior knowledge and learn the prior action distribution of the policy by a powerful exploration ability, which a gradient-based algorithm cannot do easily.
URI
https://hdl.handle.net/10371/186549
DOI
https://doi.org/10.1145/3341105.3373966
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share