Heuristic Selection of Actions in Multiagent Reinforcement Learning

Advanced search

Search - Use quotation marks to get a more specific result

Index

Field of knowledge

Start date

Betweenand

Author(s):	Bianchi, Reinaldo A. C. ; Ribeiro, Carlos H. C. ; Costa, Anna H. R. ; Veloso, MM Total Authors: 4
Document type:	Journal article
Source:	2022 25TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2022); v. N/A, p. 6-pg., 2007-01-01.
Abstract
This work presents a new algorithm, called Heuristically Accelerated Minimax-Q (HAMMQ), that allows the use of heuristics to speed up the well-known Multiagent Reinforcement Learning algorithm Minimax-Q. A heuristic function H that influences the choice of the actions characterises the HAMMQ algorithm. This function is associated with a preference policy that indicates that a certain action must be taken instead of another. A set of empirical evaluations were conducted for the proposed algorithm in a simplified simulator for the robot soccer domain, and experimental results show that even very simple heuristics enhances significantly the performance of the multiagent reinforcement learning algorithm. (AU)

FAPESP's process:	06/05667-0 - Heuristic selection of actions in multiagent reinforcement learning.
Grantee:	Reinaldo Augusto da Costa Bianchi
Support Opportunities:	Research Grants - Meeting - Abroad