Direção autônoma: apredendo a tomar decisões na presença de incertezas

Júnior Anderson Rodrigues da Silva

Full text
Author(s):	Júnior Anderson Rodrigues da Silva Total Authors: 1
Document type:	Doctoral Thesis
Press:	São Carlos.
Institution:	Universidade de São Paulo (USP). Instituto de Ciências Matemáticas e de Computação (ICMC/SB)
Defense date:	2023-06-30
Examining board members:	Denis Fernando Wolf; Reinaldo Augusto da Costa Bianchi; Diego Furtado Silva; Adriano Almeida Gonçalves Siqueira
Advisor:	Denis Fernando Wolf
Abstract
A vehicle navigating in an urban environment must obey traffic rules by properly setting its speed in order to stay below the road speed limit and avoiding collisions. This is presumably the scenario that autonomous vehicles will face: they will share the traffic roads with other vehicles (autonomous or not), cooperatively interacting with them. In other words, autonomous vehicles should not only follow traffic rules, but should also behave in such a way that resembles other vehicles behavior. However, manually specification of such behavior is a time-consuming and error-prone work, since driving in urban roads is a complex task, which involves many factors. Furthermore, since the interaction between vehicles is inherent to driving, inferring surrounding vehicles motion is essential to provide a more fluid navigation, avoiding a over-reactive behavior. In this sense, the uncertainty coming from noisy sensor measurements and unknown surrounding vehicles behavior cannot been neglected in order to guarantee safe and reliable decisions. In this thesis, we propose using Partially Observable Markov Decision Process (POMDP) to address the problem of incomplete information inherent of motion planning for autonomous driving. We also propose a variant of Maximum Entropy Inverse Reinforcement Learning (IRL) to learn human expert behavior from demonstration. Three different urban scenarios are covered throughout this work: longitudinal planning at signalized intersection by considering noisy measurements sensor; longitudinal and lateral planning on multi-lane roads in the presence of surrounding vehicles, in which their intention of changing lane are inferred from sequential observations; longitudinal and lateral planning during merge maneuvers in a highly interactive scenario, in which the autonomous vehicle behavior is learned from real data containing human demonstrations. Results show that our methods compare favorably to approaches that neglected uncertainty during planning, and also can improve the IRL performance, which adds safety and reliability in the decision-making. (AU)

FAPESP's process:	18/19732-5 - Decision making and trajectory planning for intelligent vehicles using partially observable Markov decision processes and inverse reinforcement learning
Grantee:	Júnior Anderson Rodrigues da Silva
Support Opportunities:	Scholarships in Brazil - Doctorate

Short URL