Publications

Detailed Information

A convex optimization approach to distributionally robust Markov decision processes with Wasserstein distance

DC Field Value Language
dc.contributor.authorYang, Insoon-
dc.creator양인순-
dc.date.accessioned2019-04-25T00:14:02Z-
dc.date.available2020-04-05T00:14:02Z-
dc.date.created2018-11-23-
dc.date.created2018-11-23-
dc.date.issued2017-06-
dc.identifier.citationIEEE Control Systems Letters, Vol.1 No.1, pp.164-169-
dc.identifier.issn2475-1456-
dc.identifier.urihttps://hdl.handle.net/10371/148928-
dc.description.abstractWe consider the problem of constructing control policies that are robust against distribution errors in the model parameters of Markov decision processes. The Wasserstein metric is used to model the ambiguity set of admissible distributions. We prove the existence and optimality of Markov policies and develop convex optimization-based tools to compute and analyze the policies. Our methods, which are based on the Kantorovich convex relaxation and duality principle, have the following advantages. First, the proposed dual formulation of an associated Bellman equation resolves the infinite dimensionality issue that is inherent in its original formulation when the nominal distribution has a finite support. Second, our duality analysis identifies the structure of a worst-case distribution and provides a simple decentralized method for its construction. Third, a sensitivity analysis tool is developed to quantify the effect of ambiguity set parameters on the performance of distributionally robust policies. The effectiveness of our proposed tools is demonstrated through a human-centered air conditioning problem.-
dc.language영어-
dc.language.isoenen
dc.publisherIEEE-
dc.titleA convex optimization approach to distributionally robust Markov decision processes with Wasserstein distance-
dc.typeArticle-
dc.identifier.doi10.1109/LCSYS.2017.2711553-
dc.citation.journaltitleIEEE Control Systems Letters-
dc.identifier.scopusid2-s2.0-85046301234-
dc.description.srndOAIID:RECH_ACHV_DSTSH_NO:T201735292-
dc.description.srndRECH_ACHV_FG:RR00200001-
dc.description.srndADJUST_YN:-
dc.description.srndEMP_ID:A080662-
dc.description.srndCITE_RATE:0-
dc.description.srndFILENAME:17LCSS_Wasserstein.pdf-
dc.description.srndDEPT_NM:전기·정보공학부-
dc.description.srndEMAIL:insoonyang@snu.ac.kr-
dc.description.srndSCOPUS_YN:N-
dc.description.srndFILEURL:https://srnd.snu.ac.kr/eXrepEIR/fws/file/be749eae-74f9-44e4-921f-71341ffbd584/link-
dc.citation.endpage169-
dc.citation.number1-
dc.citation.startpage164-
dc.citation.volume1-
dc.description.isOpenAccessN-
dc.contributor.affiliatedAuthorYang, Insoon-
dc.identifier.srndT201735292-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.subject.keywordAuthorOptimal control, stochastic systems, Markov processes, probability distribution, optimization, robustness.-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share