Advanced search
Start date
Betweenand
(Reference retrieved automatically from Web of Science through information on FAPESP grant and its corresponding number as mentioned in the publication by the authors.)

Using mathematical programming to solve Factored Markov Decision Processes with Imprecise Probabilities

Full text
Author(s):
Delgado, Karina Valdivia [1] ; de Barros, Leliane Nunes [2] ; Cozman, Fabio Gagliardi [3] ; Sanner, Scott [4, 5]
Total Authors: 4
Affiliation:
[1] Univ Sao Paulo, Escola Artes Ciencias & Humanidades, BR-05508 Sao Paulo - Brazil
[2] Univ Sao Paulo, Inst Matemat & Estat, BR-05508 Sao Paulo - Brazil
[3] Univ Sao Paulo, Escola Politecn, BR-05508 Sao Paulo - Brazil
[4] Australian Natl Univ, Canberra, ACT 2601 - Australia
[5] NICTA, Canberra, ACT 2601 - Australia
Total Affiliations: 5
Document type: Journal article
Source: INTERNATIONAL JOURNAL OF APPROXIMATE REASONING; v. 52, n. 7, p. 1000-1017, OCT 2011.
Web of Science Citations: 3
Abstract

This paper investigates Factored Markov Decision Processes with Imprecise Probabilities (MDPIPs); that is, Factored Markov Decision Processes (MDPs) where transition probabilities are imprecisely specified. We derive efficient approximate solutions for Factored MDPIPs based on mathematical programming. To do this, we extend previous linear programming approaches for linear approximations in Factored MDPs, resulting in a multilinear formulation for robust ``maximin{''} linear approximations in Factored MDPIPs. By exploiting the factored structure in MDPIPs we are able to demonstrate orders of magnitude reduction in solution time over standard exact non-factored approaches, in exchange for relatively low approximation errors, on a difficult class of benchmark problems with millions of states. (C) 2011 Elsevier Inc. All rights reserved. (AU)

FAPESP's process: 08/03995-5 - Logprob: probabilistic logic --- foundations and computational applications
Grantee:Marcelo Finger
Support Opportunities: Research Projects - Thematic Grants