| Grant number: | 15/16310-4 |
| Support Opportunities: | Scholarships in Brazil - Doctorate |
| Start date: | December 01, 2015 |
| End date: | September 30, 2019 |
| Field of knowledge: | Physical Sciences and Mathematics - Computer Science |
| Principal Investigator: | Anna Helena Reali Costa |
| Grantee: | Felipe Leno da Silva |
| Host Institution: | Escola Politécnica (EP). Universidade de São Paulo (USP). São Paulo , SP, Brazil |
| Associated scholarship(s): | 18/00344-5 - Reusing previous task solutions in multiagent reinforcement learning, BE.EP.DR |
Abstract Reinforcement Learning is a powerful to train intelligent agents because the learning is performed completely autonomously. This learning is accomplished through repetitive interactions between agents and the environment through trial and error, until agents have enough information to properly actuate in order to solve a given task. However, an agent can take a long time to determine which actions are more appropriate for each situation. In order to work around this problem, researchers have started to utilize Transfer Learning solutions in which, after learning a task, the acquired knowledge is reused to accelerate the learning of a new similar task. If multiple agents are acting at the same time in an environment, a fault robust, scalable and highly parallel system can be obtained. However, in this case new problems arise, for example, the state space explosion and the difficulty of predicting consequences of joint actions. Researches proposed partial solutions for these problems, in which Transfer Learning has been proved to be beneficial also to multi-agent domains. Yet, the existing Transfer Learning methods must be improved to allow their application in complex domains. This research aims to propose methods, which deal some questions that have been only superficially answered by the state of art methods. Among these question are: How to properly abstract the knowledge acquired in the learning? How to represent this knowledge? How communicate between agents to transmit learned task knowledge? How to deal with partial observability? | |
| News published in Agência FAPESP Newsletter about the scholarship: | |
| More itemsLess items | |
| TITULO | |
| Articles published in other media outlets ( ): | |
| More itemsLess items | |
| VEICULO: TITULO (DATA) | |
| VEICULO: TITULO (DATA) | |