Publications

Detailed Information

A convex optimization approach to distributionally robust Markov decision processes with Wasserstein distance

Cited 0 time in Web of Science Cited 48 time in Scopus
Authors

Yang, Insoon

Issue Date
2017-06
Publisher
IEEE
Citation
IEEE Control Systems Letters, Vol.1 No.1, pp.164-169
Abstract
We consider the problem of constructing control policies that are robust against distribution errors in the model parameters of Markov decision processes. The Wasserstein metric is used to model the ambiguity set of admissible distributions. We prove the existence and optimality of Markov policies and develop convex optimization-based tools to compute and analyze the policies. Our methods, which are based on the Kantorovich convex relaxation and duality principle, have the following advantages. First, the proposed dual formulation of an associated Bellman equation resolves the infinite dimensionality issue that is inherent in its original formulation when the nominal distribution has a finite support. Second, our duality analysis identifies the structure of a worst-case distribution and provides a simple decentralized method for its construction. Third, a sensitivity analysis tool is developed to quantify the effect of ambiguity set parameters on the performance of distributionally robust policies. The effectiveness of our proposed tools is demonstrated through a human-centered air conditioning problem.
ISSN
2475-1456
Language
English
URI
https://hdl.handle.net/10371/148928
DOI
https://doi.org/10.1109/LCSYS.2017.2711553
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share