Busca avançada
Ano de início
Entree


Artificial intelligence in healthcare applications targeting cancer diagnosis-part II: interpreting the model outputs and spotlighting the performance metrics

Texto completo
Autor(es):
Mostrar menos -
Araujo, Anna Luiza Damaceno ; Sperandio, Marcelo ; Calabrese, Giovanna ; Faria, Sarah S. ; Cardenas, Diego Armando Cardona ; Martins, Manoela Domingues ; Vargas, Pablo Agustin ; Lopes, Marcio Ajudarte ; Santos-Silva, Alan Roger ; Kowalski, Luiz Paulo ; Moraes, Matheus Cardoso
Número total de Autores: 11
Tipo de documento: Artigo Científico
Fonte: ORAL SURGERY ORAL MEDICINE ORAL PATHOLOGY ORAL RADIOLOGY; v. 140, n. 1, p. 11-pg., 2025-07-01.
Resumo

Background. The lack of standardized performance assessment metrics and the inconsistent reporting of results can lead to the presentation of overly optimistic outcomes that fail to accurately represent key aspects of the Machine Learning framework and may not align with real-world clinical needs. Methods. This conceptual review of the literature compiled the theoretical basis for performance analysis of binary and multiclass models. Results. Accuracy and error rates are straightforward but not ideal if dataset is imbalanced. Sensitivity (recall) and specificity are essential (cancer patients correctly identified as having cancer and benign patients accurately classified as such), as well as precision (identification of true cancer cases among those predicted to have it without falsely labeling healthy individuals as diseased). F1-Score balances precision and recall, while AuC combines sensitivity and specificity, assessing performance across different distributions. Kaplan-Meier curves and log-rank tests offer further insights into model performance over time, especially in survival contexts. Conclusion. Each evaluation metric highlights specific aspects of Convolutional Neural Network training, making it unfeasible to choose just a few (generally the most "convenient" ones) to report in research. (Oral Surg Oral Med Oral Pathol Oral Radiol 2025;140:89-99) (AU)

Processo FAPESP: 21/14585-7 - Inteligência artificial aplicada ao diagnóstico clínico e histopatológico do Câncer de Cabeça e Pescoço
Beneficiário:Anna Luiza Damaceno Araujo
Modalidade de apoio: Bolsas no Brasil - Pós-Doutorado