Publications

Detailed Information

Stochastic Subgradient Methods for Dynamic Programming in Continuous State and Action Spaces

Cited 1 time in Web of Science Cited 1 time in Scopus
Authors

Jang, Sunho; Yang, Insoon

Issue Date
2019-12
Publisher
Institute of Electrical and Electronics Engineers Inc.
Citation
Proceedings of the IEEE Conference on Decision and Control, Vol.2019-December, pp.7287-7293
Abstract
© 2019 IEEE.In this paper, we propose a numerical method for dynamic programming in continuous state and action spaces. We first approximate the Bellman operator by using a convex optimization problem, which has many constraints. This convex program is then solved using stochastic subgradient descent. To avoid the full projection onto the high-dimensional feasible set, we develop a novel algorithm that samples, in a coordinated fashion, a mini-batch for a subgradient and another for projection. We show several salient properties of this algorithm, including convergence, and a reduction in the feasibility error and in the variance of the stochastic subgradient.
ISSN
0191-2216
URI
https://hdl.handle.net/10371/198096
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share