Advanced search
Start date
Betweenand


Machine learning for the prediction of toxicities from head and neck cancer treatment: A systematic review with meta-analysis

Full text
Author(s):
Show less -
Araujo, Anna Luiza Damaceno ; Moraes, Matheus Cardoso ; Perez-di-Oliveira, Maria Eduarda ; da Silva, Viviane Mariano ; Saldivia-Siracusa, Cristina ; Pedroso, Caique Mariano ; Lopes, Marcio Ajudarte ; Vargas, Pablo Agustin ; Kochanny, Sara ; Pearson, Alexander ; Khurram, Syed Ali ; Kowalski, Luiz Paulo ; Migliorati, Cesar Augusto ; Santos-Silva, Alan Roger
Total Authors: 14
Document type: Journal article
Source: Oral Oncology; v. 140, p. 15-pg., 2023-04-04.
Abstract

Introduction: The aim of the present systematic review (SR) is to summarize Machine Learning (ML) models currently used to predict head and neck cancer (HNC) treatment-related toxicities, and to understand the impact of image biomarkers (IBMs) in prediction models (PMs). The present SR was conducted following the guidelines of the PRISMA 2022 and registered in PROSPERO database (CRD42020219304). Methods: The acronym PICOS was used to develop the focused review question (Can PMs accurately predict HNC treatment toxicities?) and the eligibility criteria. The inclusion criteria enrolled Prediction Model Studies (PMSs) with patient cohorts that were treated for HNC and developed toxicities. Electronic database search encompassed PubMed, EMBASE, Scopus, Cochrane Library, Web of Science, LILACS, and Gray Literature (Google Scholar and ProQuest). Risk of Bias (RoB) was assessed through PROBAST and the results were synthesized based on the data format (with and without IBMs) to allow comparison. Results: A total of 28 studies and 4,713 patients were included. Xerostomia was the most frequently investigated toxicity (17; 60.71 %). Sixteen (57.14 %) studies reported using radiomics features in combination with clinical or dosimetrics/dosiomics for modelling. High RoB was identified in 23 studies. Meta-analysis (MA) showed an area under the receiver operating characteristics curve (AUROC) of 0.82 for models with IBMs and 0.81 for models without IBMs (p value < 0.001), demonstrating no difference among IBM-and non-IBM-based models. Discussion: The development of a PM based on sample-specific features represents patient selection bias and may affect a model's performance. Heterogeneity of the studies as well as non-standardized metrics prevent proper comparison of studies, and the absence of an independent/external test does not allow the evaluation of the model's generalization ability. Conclusion: IBM-featured PMs are not superior to PMs based on non-IBM predictors. The evidence was appraised as of low certainty. (AU)

FAPESP's process: 22/07276-0 - Clinicopathological and digital predictors of recurrence and malignancy of oral leukoplakia and proliferative verrucous leukoplakia: a clinical trial associated with the use of artificial intelligence
Grantee:Caique Mariano Pedroso
Support Opportunities: Scholarships in Brazil - Doctorate
FAPESP's process: 21/14585-7 - Artificial intelligence applied to the clinical and histopathological diagnosis of Head and Neck Cancer
Grantee:Anna Luiza Damaceno Araujo
Support Opportunities: Scholarships in Brazil - Post-Doctoral
FAPESP's process: 09/53839-2 - Creation of a Digital Pathology Laboratory using a histological slidescanner
Grantee:Oslei Paes de Almeida
Support Opportunities: Multi-user Equipment Program