Busca avançada
Ano de início
Entree
(Referência obtida automaticamente do Web of Science, por meio da informação sobre o financiamento pela FAPESP e o número do processo correspondente, incluída na publicação pelos autores.)

Robust active learning for the diagnosis of parasites

Texto completo
Autor(es):
Saito, Priscila T. M. [1, 2] ; Suzuki, Celso T. N. [1, 3] ; Gomes, Jancarlo F. [1, 4] ; de Rezende, Pedro J. [1] ; Falcao, Alexandre X. [1]
Número total de Autores: 5
Afiliação do(s) autor(es):
[1] Univ Estadual Campinas, Inst Comp, BR-13083852 Campinas, SP - Brazil
[2] Fed Univ Technol, Dept Comp, Curitiba, Parana - Brazil
[3] IMMUNOCAMP Res & Dev Technol, Campinas, SP - Brazil
[4] Univ Estadual Campinas, Inst Biol, BR-13083852 Campinas, SP - Brazil
Número total de Afiliações: 4
Tipo de documento: Artigo Científico
Fonte: PATTERN RECOGNITION; v. 48, n. 11, SI, p. 3572-3583, NOV 2015.
Citações Web of Science: 8
Resumo

We have developed an automated system for the diagnosis of intestinal parasites from optical microscopy images. The objects (species of parasites and impurities) segmented from these images form a large dataset We are interested in the active learning problem of selecting a reasonably small number of objects to be labeled under an expert's supervision for use in training a pattern classifier. However, impurities are very numerous, constitute several clusters in the feature space, and can be quite similar to some species of parasites, leading to a significant challenge for active learning methods. We propose a technique that pre-organizes the data and then properly balances the selection of samples from all classes and uncertain samples for training. Early data organization avoids reprocessing of the large dataset at each learning iteration, enabling the halting of sample selection after a desired number of samples per iteration, yielding interactive response time. We validate our method by comparing it with state-of-the-art approaches, using a previously labeled dataset of almost 6000 objects. Moreover, we report results from experiments on a very realistic scenario, consisting of a dataset with over 140,000 unlabeled objects, under unbalanced classes, the absence of some classes, and the presence of a very large set of impurities. (C) 2015 Elsevier Ltd. All rights reserved. (AU)

Processo FAPESP: 07/52015-0 - Métodos de aproximação para computação visual
Beneficiário:Jorge Stolfi
Linha de fomento: Auxílio à Pesquisa - Temático