Busca avançada
Ano de início
Entree
(Referência obtida automaticamente do Web of Science, por meio da informação sobre o financiamento pela FAPESP e o número do processo correspondente, incluída na publicação pelos autores.)

Multiple instance classification: Bag noise filtering for negative instance noise cleaning

Texto completo
Autor(es):
Luengo, Julian [1] ; Sanchez-Tarrago, Danel [2] ; Prati, Ronaldo C. [3] ; Herrera, Francisco [1]
Número total de Autores: 4
Afiliação do(s) autor(es):
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada 18071 - Spain
[2] Cent Univ Marta Abreu Las Villas, Dept Comp Sci, Santa Clara - Cuba
[3] Fed Univ ABC UFABC, Ctr Math Comp Sci & Cognit CMCC, Santo Andre, SP - Brazil
Número total de Afiliações: 3
Tipo de documento: Artigo Científico
Fonte: INFORMATION SCIENCES; v. 579, p. 388-400, NOV 2021.
Citações Web of Science: 0
Resumo

Data in the real world is far from being perfect. The appearance of noise is a common issue that arises from the limitations of data acquisition mechanisms and human knowledge. In classification, label noise will hinder the performance of almost all classifiers, inducing a bias in the built model. While label noise has recently attracted researchers' attention in standard classification, it has only recently begun to be studied in multiple instance clas-sification. In this work, we propose the usage of filtering algorithms for multiple instance classification that are able to reduce the impact of negative instances within the bags. In order to do so, we decompose the bags to form a standard classification problem that can be efficiently treated by a specialized noise filter. Such a decomposition is tackled in different ways, with the aim of exploiting the knowledge offered by the examples from opposite bags. The bags are then rebuilt, without the identified noise instances. In our experiments, we show that by applying our approach we can diminish the impact of noise and even obtain better results at 0% noise level for several classifiers. Our approach sets out a promising approach to dealing with noise in the bags of multiple instance datasets and further improve the classification rate of the built models. (c) 2021 Published by Elsevier Inc. (AU)

Processo FAPESP: 15/20606-6 - Rótulos imprecisos em Aprendizado de Máquina: Medidas de avaliação e algoritmos de aprendizado de máquina
Beneficiário:Ronaldo Cristiano Prati
Modalidade de apoio: Bolsas no Exterior - Pesquisa