Advanced search
Start date
Betweenand

Real Time Dynamic Programming and Monte-Carlo Simulation for Probabilistic Planning

Grant number: 11/16962-0
Support Opportunities:Scholarships in Brazil - Master
Start date: March 01, 2012
End date: September 30, 2014
Field of knowledge:Physical Sciences and Mathematics - Computer Science - Theory of Computation
Principal Investigator:Leliane Nunes de Barros
Grantee:Luis Gustavo Rocha Vianna
Host Institution: Instituto de Matemática e Estatística (IME). Universidade de São Paulo (USP). São Paulo , SP, Brazil
Associated research grant:08/03995-5 - Logprob: probabilistic logic --- foundations and computational applications, AP.TEM
Associated scholarship(s):12/10861-0 - Asynchronous dynamic programming for discrete and continuous Markov decision processes, BE.EP.MS

Abstract

Probabilistic planning problems are often modeled as Markov Decision Processes (MDPs). The Solution to one such problem is an optimal policy, i.e., a mapping from states to actions that will bring the greatest estimated reward for an agent that follows it. A well known class of algorithms in the case that the initial state and the goals are known is based on Real Time Dynamic Programming (RTDP), whilst another possibility is the Monte Carlo Tree Search, based on stochastic simulations, in this work we intend to compare and analyse this two kind of solutions.

News published in Agência FAPESP Newsletter about the scholarship:
More itemsLess items
Articles published in other media outlets ( ):
More itemsLess items
VEICULO: TITULO (DATA)
VEICULO: TITULO (DATA)

Scientific publications
(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)
VIANNA, LUIS G. R.; DE BARROS, LELIANE N.; SANNER, SCOTT; AAAI. Real-Time Symbolic Dynamic Programming for Hybrid MDPs. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, v. N/A, p. 7-pg., . (11/16962-0)
Academic Publications
(References retrieved automatically from State of São Paulo Research Institutions)
VIANNA, Luis Gustavo Rocha. Approximate and asynchronous symbolic dynamic programming for Markov decision processes in continuous spaces. 2015. Master's Dissertation - Universidade de São Paulo (USP). Instituto de Matemática e Estatística (IME/SBI) São Paulo.