Biblioteca Virtual - Centro de Documentação e Informação da FAPESP

Busca avançada

Pesquisar - Utilize aspas para obter um resultado mais específico

Índice

Área do conhecimento

Ano de início

Entree

(Referência obtida automaticamente do Web of Science, por meio da informação sobre o financiamento pela FAPESP e o número do processo correspondente, incluída na publicação pelos autores.)

Texto completo
Autor(es):	de Aquino Afonso, Bruno Klaus ^[1] ; Berton, Lilian ^[1] Número total de Autores: 2
Afiliação do(s) autor(es):	^[1] Fed Univ Sao Paulo UNIFESP, Inst Sci & Technol, Sao Jose Dos Campos, SP - Brazil Número total de Afiliações: 1
Tipo de documento:	Artigo Científico
Fonte:	PATTERN RECOGNITION LETTERS; v. 140, p. 127-134, DEC 2020.
Citações Web of Science:	0
Resumo
Obtaining data with meaningful labels is often costly and error-prone. In this situation, semi-supervised learning (SSL) approaches are interesting, as they leverage assumptions about the unlabeled data to make up for the limited amount of labels. However, in real-world situations, we cannot assume that the labeling process is infallible, and the accuracy of many SSL classifiers decreases significantly in the presence of label noise. In this work, we introduce the LGC\_LVOf , a leave-one-out filtering approach based on the Local and Global Consistency (LGC) algorithm. Our method aims to detect and remove wrong labels, and thus can be used as a preprocessing step to any SSL classifier. Given the propagation matrix, detecting noisy labels takes O(cl) per step, with c the number of classes and l the number of labels. Moreover, one does not need to compute the whole matrix, but only al x l submatrix corresponding to interactions between labeled instances. As a result, our approach is best suited to datasets with a large amount of unlabeled data but not many labels. Results are provided for a number of datasets, including MNIST and ISOLET . LGC\_LVOf appears to be equally or more precise than the adapted gradient-based filter, and thus can be used in practice for active learning, where it may iteratively send labels for re-evaluation. We show that the best-case accuracy of the embedding of LGC\_LVOf into LGC yields performance comparable to the best-case of t 1-based classifiers designed to be robust to label noise. (C) 2020 Elsevier B.V. All rights reserved. (AU)

Processo FAPESP:	18/01722-3 - Aprendizado semissupervisionado via redes complexas: construção de redes, seleção e propagação de rótulos e aplicações
Beneficiário:	Lilian Berton
Modalidade de apoio:	Auxílio à Pesquisa - Regular


Processo FAPESP:	18/15014-0 - Redução no ruído dos rótulos no aprendizado semi-supervisionado baseado em grafos
Beneficiário:	Bruno Klaus de Aquino Afonso
Modalidade de apoio:	Bolsas no Brasil - Mestrado