Publications

Detailed Information

Characterizing optimal mixed policies: Where to intervene and what to observe

Cited 0 time in Web of Science Cited 11 time in Scopus
Authors

Lee, Sanghack; Bareinboim, Elias

Issue Date
2020-12
Publisher
Neural information processing systems foundation
Citation
Advances in Neural Information Processing Systems, Vol.2020-December
Abstract
© 2020 Neural information processing systems foundation. All rights reserved.Intelligent agents are continuously faced with the challenge of optimizing a policy based on what they can observe (see) and which actions they can take (do) in the environment where they are deployed. Most policies can be parametrized in terms of these two dimensions, i.e., as a function of what can be seen and done given a certain situation, which we call a mixed policy. In this paper, we investigate several properties of the class of mixed policies and provide an efficient and effective characterization, including optimality and non-redundancy. Specifically, we introduce a graphical criterion to identify unnecessary contexts for a set of actions, leading to a natural characterization of non-redundancy of mixed policies. We then derive sufficient conditions under which one strategy can dominate the other with respect to their maximum achievable expected rewards (optimality). This characterization leads to a fundamental understanding of the space of mixed policies and a possible refinement of the agents strategy so that it converges to the optimum faster and more robustly. One surprising result of the causal characterization is that the agent following a more standard approach—intervening on all intervenable variables and observing all available contexts—may be hurting itself, and will never achieve an optimal performance.
ISSN
1049-5258
URI
https://hdl.handle.net/10371/201555
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Related Researcher

  • Graduate School of Data Science
Research Area Causal Decision Making, Causal Discovery, Causal Inference

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share