euristics, Answer Set Programming and Markov Decision Process for Solving a Set of Spatial Puzzles{*

dos Santos, Thiago Freitas; Santos, Paulo E.; Ferreira, Leonardo Anjoletto; Bianchi, Reinaldo A. C.; Cabalar, Pedro

Texto completo
Autor(es):	dos Santos, Thiago Freitas ^[1] ; Santos, Paulo E. ^{[2, 1]} ; Ferreira, Leonardo Anjoletto ^[1] ; Bianchi, Reinaldo A. C. ^[1] ; Cabalar, Pedro ^[3] Número total de Autores: 5
Afiliação do(s) autor(es):	^[1] Ctr Univ FEI, Elect Engn Dept, Sao Bernardo Do Campo - Brazil ^[2] Flinders Univ S Australia, Coll Sci & Engn, Adelaide, SA - Australia ^[3] Univ A Coruna, Dept Comp Sci, La Coruna - Spain Número total de Afiliações: 3
Tipo de documento:	Artigo Científico
Fonte:	APPLIED INTELLIGENCE; v. 52, n. 4 JUL 2021.
Citações Web of Science:	0
Resumo
Spatial puzzles composed of rigid objects, flexible strings and holes offer interesting challenges for reasoning about spatial entities that are common in the human daily-life's activities. This motivates the use of spatial puzzles as domains of study in this work. The goal of this paper is to investigate the automated solution of this kind of problems by extending an algorithm that combines Answer Set Programming (ASP) with Markov Decision Process (MDP) and Reinforcement Learning (RL), called oASP(MDP). This method is capable of constructing the set of domain states online, i.e., while the agent interacts with a changing environment. The aim of the extension proposed in this work is to add heuristics as a mechanism to accelerate the learning process, resulting in the main contribution of this paper: the Heuristic oASP(MDP) (HoASP(MDP)) algorithm. Experiments were performed on deterministic, non-deterministic and non-stationary versions of the puzzles. Results show that the proposed approach can considerably accelerate the learning process, outperforming other state-of-the-art methods. (AU)

Processo FAPESP:	16/21047-3 - ALIS: Aprendizado Autônomo em Sistemas Inteligentes
Beneficiário:	Anna Helena Reali Costa
Modalidade de apoio:	Auxílio à Pesquisa - Regular


Processo FAPESP:	17/07833-9 - Heurística e planejamento eficiente para problemas espaciais
Beneficiário:	Thiago Freitas dos Santos
Modalidade de apoio:	Bolsas no Brasil - Mestrado


Processo FAPESP:	16/18792-9 - Descrição, representação e solução de jogos espaciais
Beneficiário:	Paulo Eduardo Santos
Modalidade de apoio:	Auxílio à Pesquisa - Parceria para Inovação Tecnológica - PITE

URL curto