Publications

Detailed Information

Stochastic Subgradient Methods for Dynamic Programming in Continuous State and Action Spaces

DC Field Value Language
dc.contributor.authorJang, Sunho-
dc.contributor.authorYang, Insoon-
dc.date.accessioned2023-12-11T04:11:22Z-
dc.date.available2023-12-11T04:11:22Z-
dc.date.created2020-06-26-
dc.date.issued2019-12-
dc.identifier.citationProceedings of the IEEE Conference on Decision and Control, Vol.2019-December, pp.7287-7293-
dc.identifier.issn0191-2216-
dc.identifier.urihttps://hdl.handle.net/10371/198096-
dc.description.abstract© 2019 IEEE.In this paper, we propose a numerical method for dynamic programming in continuous state and action spaces. We first approximate the Bellman operator by using a convex optimization problem, which has many constraints. This convex program is then solved using stochastic subgradient descent. To avoid the full projection onto the high-dimensional feasible set, we develop a novel algorithm that samples, in a coordinated fashion, a mini-batch for a subgradient and another for projection. We show several salient properties of this algorithm, including convergence, and a reduction in the feasibility error and in the variance of the stochastic subgradient.-
dc.language영어-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleStochastic Subgradient Methods for Dynamic Programming in Continuous State and Action Spaces-
dc.typeArticle-
dc.citation.journaltitleProceedings of the IEEE Conference on Decision and Control-
dc.identifier.scopusid2-s2.0-85082447218-
dc.citation.endpage7293-
dc.citation.startpage7287-
dc.citation.volume2019-December-
dc.description.isOpenAccessN-
dc.contributor.affiliatedAuthorYang, Insoon-
dc.type.docTypeConference Paper-
dc.description.journalClass1-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share