Advanced search
Start date
Betweenand


(OPF)-P-2: Oversampling via Optimum-Path Forest for Breast Cancer Detection

Full text
Author(s):
Show less -
Passos, Leandro A. ; Jodas, Danilo S. ; Ribeiro, Luiz C. F. ; Moreira, Thierry ; Papa, Joao P. ; DeHerrera, AGS ; Gonzalez, AR ; Santosh, KC ; Temesgen, Z ; Kane, B ; Soda, P
Total Authors: 11
Document type: Journal article
Source: 2020 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS(CBMS 2020); v. N/A, p. 6-pg., 2020-01-01.
Abstract

Breast cancer is among the most deadly diseases, distressing mostly women worldwide. Although traditional methods for detection have presented themselves as valid for the task, they still commonly present low accuracies and demand considerable time and effort from professionals. Therefore, a computer-aided diagnosis (CAD) system capable of providing early detection becomes hugely desirable. In the last decade, machine learning-based techniques have been of paramount importance in this context, since they are capable of extracting essential information from data and reasoning about it. However, such approaches still suffer from imbalanced data, specifically on medical issues, where the number of healthy people samples is, in general, considerably higher than the number of patients. Therefore this paper proposes the (OPF)-P-2, a data oversampling method based on the unsupervised Optimum-Path Forest Algorithm. Experiments conducted over the full oversampling scenario state the robustness of the model, which is compared against three well-established oversampling methods considering three breast cancer and three general-purpose tasks for medical issues datasets. (AU)

FAPESP's process: 13/07375-0 - CeMEAI - Center for Mathematical Sciences Applied to Industry
Grantee:Francisco Louzada Neto
Support Opportunities: Research Grants - Research, Innovation and Dissemination Centers - RIDC
FAPESP's process: 14/12236-1 - AnImaLS: Annotation of Images in Large Scale: what can machines and specialists learn from interaction?
Grantee:Alexandre Xavier Falcão
Support Opportunities: Research Projects - Thematic Grants
FAPESP's process: 19/18287-0 - Real-time Urban Forest Management Using Machine Learning
Grantee:Danilo Samuel Jodas
Support Opportunities: Scholarships in Brazil - Post-Doctoral
FAPESP's process: 19/07665-4 - Center for Artificial Intelligence
Grantee:Fabio Gagliardi Cozman
Support Opportunities: Research Grants - Research Program in eScience and Data Science - Research Centers in Engineering Program