An effective combination of loss gradients for multi-task learning applied on instance segmentation and depth estimation

Nakamura, Angelica Tiemi Mizuno; Grassi Jr, Valdir; Wolf, Denis Fernando

Texto completo
Autor(es):	Nakamura, Angelica Tiemi Mizuno ^[1] ; Grassi Jr, Valdir ; Wolf, Denis Fernando ^[1] Número total de Autores: 3
Afiliação do(s) autor(es):	^[1] Univ Sao Paulo, Inst Math & Comp Sci, Mobile Robot Lab, Sao Carlos - Brazil Número total de Afiliações: 1
Tipo de documento:	Artigo Científico
Fonte:	ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE; v. 100, APR 2021.
Citações Web of Science:	0
Resumo
Advanced driver assistance systems are responsible for assisting decision making and can play an important role in safety and traffic efficiency. Such systems require robust perception methods to handle complex urban scenes, and one way to achieve this is through instance segmentation. However, due to the difficulty in separating overlapping objects into different instances, this task becomes very challenging. For this, several authors proposed CNN-based methods and used depth information to enhance the instance segmentation performance. A promising way to explore this information is by adopting a multi-task learning approach, in which multiple tasks are learned simultaneously by sharing the same architecture. Usually, this combination is made by the weighted sum of loss functions, in which the weight of each task is defined manually. Nonetheless, when tasks have different natures with variation in the order of magnitude, performing this combination during training so that all tasks converge towards their optimal solution is not trivial. Aiming to get the best possible solution, we modeled the multi-task learning as a multiobjective optimization problem and, as the main contribution of this paper, we proposed a greedy approach to find the weighting coefficients for each task, performing a trade-off between tasks that allow the optimization of multiple loss functions. Experimental results showed that it is possible to enhance instance segmentation when depth information is properly explored. Moreover, not only did depth information help instance segmentation, but also did the instance segmentation help the depth estimations, achieving better performance compared to single-task models. (AU)

Processo FAPESP:	19/03366-2 - Segmentação espacial de instâncias a partir de câmera monocular utilizando redes neurais convolutivas
Beneficiário:	Angelica Tiemi Mizuno Nakamura
Modalidade de apoio:	Bolsas no Brasil - Doutorado

URL curto