Leveraging convergence behavior to balance conflicting tasks in multi-task learning

Mizuno Nakamura, Angelica Tiemi; Grassi Jr, Valdir; Wolf, Denis Fernando

Texto completo
Autor(es):	Mizuno Nakamura, Angelica Tiemi ; Grassi Jr, Valdir ; Wolf, Denis Fernando Número total de Autores: 3
Tipo de documento:	Artigo Científico
Fonte:	Neurocomputing; v. 511, p. 11-pg., 2022-09-13.
Resumo
Multi-Task Learning is a learning paradigm that uses correlated tasks to improve performance generalization. A common way to learn multiple tasks is through the hard parameter sharing approach, in which a single architecture is used to share the same subset of parameters, creating an inductive bias between them during the training process. Due to its simplicity, potential to improve generalization, and reduce computational cost, it has gained the attention of the scientific and industrial communities. However, tasks often conflict with each other, which makes it challenging to define how the gradients of multiple tasks should be combined to allow simultaneous learning. To address this problem, we use the idea of multi-objective optimization to propose a method that takes into account temporal behaviour of the gradients to create a dynamic bias that adjusts the importance of each task during backpropagation. The result of this method is to give more attention to tasks that are diverging or not being benefited during the last iterations, ensuring that the simultaneous learning is heading to the performance maximization of all tasks. As a result, we empirically show that the proposed method outperforms the state-of-the-art approaches on learning conflicting tasks. Unlike the adopted baselines, our method ensures that all tasks reach good generalization performances. (C) 2022 Elsevier B.V. All rights reserved. (AU)

Processo FAPESP:	14/50851-0 - INCT 2014: Instituto Nacional de Ciência e Tecnologia para Sistemas Autônomos Cooperativos Aplicados em Segurança e Meio Ambiente
Beneficiário:	Marco Henrique Terra
Modalidade de apoio:	Auxílio à Pesquisa - Temático


Processo FAPESP:	19/03366-2 - Segmentação espacial de instâncias a partir de câmera monocular utilizando redes neurais convolutivas
Beneficiário:	Angelica Tiemi Mizuno Nakamura
Modalidade de apoio:	Bolsas no Brasil - Doutorado

URL curto