Busca avançada
Ano de início
Entree


When climate variables improve the dengue forecasting: a machine learning approach

Texto completo
Autor(es):
da Silva, Sidney T. ; Gabrick, Enrique C. ; Protachevicz, Paulo R. ; Iarosz, Kelly C. ; Caldas, Ibere L. ; Batista, Antonio M. ; Kurths, Juergen
Número total de Autores: 7
Tipo de documento: Artigo Científico
Fonte: European Physical Journal-Special Topics; v. N/A, p. 15-pg., 2024-06-17.
Resumo

Dengue is a viral vector-borne infectious disease that affects many countries worldwide, infecting around 390 million people per year. The main outbreaks occur in subtropical and tropical countries. We, therefore, study here the influence of climate on dengue. In particular, we consider dengue and meteorological data from Natal (2016-2019), Brazil, Iquitos (2001-2012), Peru, and Barranquilla (2011-2016), Colombia. For the analysis and simulations, we apply machine learning (ML) techniques, especially the random forest (RF) algorithm. We utilize dengue disease cases and climate data delayed by up to one week to forecast the cases of dengue. In addition, regarding as feature in the ML technique, we analyze three possibilities: only dengue cases (D); climate and dengue cases (CD); humidity and dengue cases (HD). Depending on the city, our results show that the climate data can improve or not the forecast. For instance, for Natal, the case D induces a better forecast. For Iquitos, it is better to use all the climate variables. Nonetheless, for Barranquilla, the forecast is better, when we include cases and humidity data. Another important result is that each city has an optimal region based on the training length. For Natal, when we use more than 64% and less than 80% of the time series for training, we obtain results with correlation coefficients (r) among 0.917 and 0.949 and mean absolute errors (MAE) among 57.783 and 71.768 for the D case in forecasting. The optimal range for Iquitos is obtained when 79% up to 88% of the time series is considered for training. For this case, the best case is CD, having a minimum r equal to 0.850 and maximum 0.887, while values of MAE oscillate among 2.780 and 4.156. For Barranquilla, the optimal range occurs between 72% until 82% of length training. In this case, the better approach is HD, where the measures exhibit a minimum r equal to 0.942 and maximum 0.953, while the minimum and maximum MAE vary among 6.085 and 6.669. We show that the forecast of dengue cases is a challenging problem and climate variables do not always help. However, when we include the mentioned climate variables, the most important one is the humidity. (AU)

Processo FAPESP: 22/13761-9 - Dinâmica de Sistemas Complexos
Beneficiário:Iberê Luiz Caldas
Modalidade de apoio: Auxílio à Pesquisa - Pesquisador Visitante - Brasil
Processo FAPESP: 18/03211-6 - Dinâmica não linear
Beneficiário:Iberê Luiz Caldas
Modalidade de apoio: Auxílio à Pesquisa - Temático
Processo FAPESP: 23/12863-5 - Sincronização em redes neuronais com plasticidade sináptica de longa duração
Beneficiário:Paulo Ricardo Protachevicz
Modalidade de apoio: Bolsas no Exterior - Estágio de Pesquisa - Pós-Doutorado
Processo FAPESP: 20/04624-2 - Plasticidade sináptica em redes neuronais
Beneficiário:Paulo Ricardo Protachevicz
Modalidade de apoio: Bolsas no Brasil - Pós-Doutorado