Please use this identifier to cite or link to this item: https://doi.org/10.1287/moor.1120.0540
DC FieldValue
dc.titleDistributionally robust markov decision processes
dc.contributor.authorXu, H.
dc.contributor.authorMannor, S.
dc.date.accessioned2014-06-17T06:17:49Z
dc.date.available2014-06-17T06:17:49Z
dc.date.issued2012-05
dc.identifier.citationXu, H., Mannor, S. (2012-05). Distributionally robust markov decision processes. Mathematics of Operations Research 37 (2) : 288-300. ScholarBank@NUS Repository. https://doi.org/10.1287/moor.1120.0540
dc.identifier.issn0364765X
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/59978
dc.description.abstractWe consider Markov decision processes where the values of the parameters are uncertain. This uncertainty is described by a sequence of nested sets (that is, each set contains the previous one), each of which corresponds to a probabilistic guarantee for a different confidence level. Consequently, a set of admissible probability distributions of the unknown parameters is specified. This formulation models the case where the decision maker is aware of and wants to exploit some (yet imprecise) a priori information of the distribution of parameters, and it arises naturally in practice where methods for estimating the confidence region of parameters abound. We propose a decision criterion based on distributional robustness: the optimal strategy maximizes the expected total reward under the most adversarial admissible probability distributions. We show that finding the optimal distributionally robust strategy can be reduced to the standard robust MDP where parameters are known to belong to a single uncertainty set; hence, it can be computed in polynomial time under mild technical conditions. © 2012 INFORMS.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1287/moor.1120.0540
dc.sourceScopus
dc.subjectDistributional robustness
dc.subjectMarkov decision process
dc.subjectParameter uncertainty
dc.typeArticle
dc.contributor.departmentMECHANICAL ENGINEERING
dc.description.doi10.1287/moor.1120.0540
dc.description.sourcetitleMathematics of Operations Research
dc.description.volume37
dc.description.issue2
dc.description.page288-300
dc.description.codenMORED
dc.identifier.isiut000304227700005
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.