Evaluation of statistical and machine learning models for time series prediction: Identifying the state-of-the-art and the best conditions for the use of each model

Sabino Parmezan, Antonio Rafael; Souza, Vinicius M. A.; Batista, Gustavo E. A. P. A.

Full text
Author(s):	Sabino Parmezan, Antonio Rafael ^[1] ; Souza, Vinicius M. A. ^[1] ; Batista, Gustavo E. A. P. A. ^[1] Total Authors: 3
Affiliation:	^[1] Univ Sao Paulo, Lab Computat Intelligence, Inst Ciencias Matemat & Comp, Ave Trabalhador Sao Carlense 400, BR-13566590 Sao Carlos, SP - Brazil Total Affiliations: 1
Document type:	Journal article
Source:	INFORMATION SCIENCES; v. 484, p. 302-337, MAY 2019.
Web of Science Citations:	2
Abstract
The choice of the most promising algorithm to model and predict a particular phenomenon is one of the most prominent activities of the temporal data forecasting. Forecasting (or prediction), similarly to other data mining tasks, uses empirical evidence to select the most suitable model for a problem at hand since no modeling method can be considered as the best. However, according to our systematic literature review of the last decade, few scientific publications rigorously expose the benefits and limitations of the most popular algorithms for time series prediction. At the same time, there is a limited performance record of these models when applied to complex and highly nonlinear data. In this paper, we present one of the most extensive, impartial and comprehensible experimental evaluations ever done in the time series prediction field. From 95 datasets, we evaluate eleven predictors, seven parametric and four non-parametric, employing two multi-step-ahead projection strategies and four performance evaluation measures. We report many lessons learned and recommendations concerning the advantages, drawbacks, and the best conditions for the use of each model. The results show that SARIMA is the only statistical method able to outperform, but without a statistical difference, the following machine learning algorithms: ANN, SVM, and kNN-TSPI. However, such forecasting accuracy comes at the expense of a larger number of parameters. The evaluated datasets, as well detailed results achieved by different indexes as MSE, Theil's U coefficient, POCID, and a recently-proposed multi-criteria performance measure are available online in our repository. Such repository is another contribution of this paper since other researchers can replicate our results and evaluate their methods more rigorously. The findings of this study will impact further research on this topic since they provide a broad insight into models selection, parameters setting, evaluation measures, and experimental setup. (C) 2019 Elsevier Inc. All rights reserved. (AU)

FAPESP's process:	18/05859-3 - Intelligent traps and sensors: an innovative approach to control insect pests and disease vectors
Grantee:	Vinícius Mourão Alves de Souza
Support Opportunities:	Scholarships in Brazil - Post-Doctoral


FAPESP's process:	16/04986-6 - Intelligent traps and sensors: an innovative approach to control insect pests and disease vectors
Grantee:	Gustavo Enrique de Almeida Prado Alves Batista
Support Opportunities:	Research Grants - eScience and Data Science Program - Regular Program Grants


FAPESP's process:	13/10978-8 - Similarity-based Time Series Prediction
Grantee:	Antonio Rafael Sabino Parmezan
Support Opportunities:	Scholarships in Brazil - Master

Short URL