Busca avançada
Ano de início
Entree
(Referência obtida automaticamente do Web of Science, por meio da informação sobre o financiamento pela FAPESP e o número do processo correspondente, incluída na publicação pelos autores.)

Applying a kernel function on time-dependent data to provide supervised-learning guarantees

Texto completo
Autor(es):
Pagliosa, Lucas de Carvalho ; de Mello, Rodrigo Fernandes
Número total de Autores: 2
Tipo de documento: Artigo Científico
Fonte: EXPERT SYSTEMS WITH APPLICATIONS; v. 71, p. 216-229, APR 1 2017.
Citações Web of Science: 7
Resumo

The Statistical Learning Theory (SLT) defines five assumptions to ensure learning for supervised algorithms. Data independency is one of those assumptions, once the SLT relies on the Law of Large Numbers to ensure learning bounds. As a consequence, this assumption imposes a strong limitation to guarantee learning on time-dependent scenarios. In order to tackle this issue, some researchers relax this assumption with the detriment of invalidating all theoretical results provided by the SLT. In this paper we apply a kernel function, more precisely the Takens' immersion theorem, to reconstruct time-dependent openended sequences of observations, also referred to as data streams in the context of Machine Learning, into multidimensional spaces (a.k.a. phase spaces) in attempt to hold the data independency assumption. At first, we study the best immersion parameterization for our kernel function using the Distance-Weighted Nearest Neighbors (DWNN). Next, we use this best immersion to recursively forecast next observations based on the prediction horizon, estimated using the Lyapunov exponent. Afterwards, predicted observations are compared against the expected ones using the Mean Distance from the Diagonal Line (MDDL). Theoretical and experimental results based on a cross-validation strategy provide stronger evidences of generalization, what allows us to conclude that one can learn from time-dependent data after using our approach. This opens up a very important possibility for ensuring supervised learning when it comes to time-dependent data, being useful to tackle applications such as in the climate, animal tracking, biology and other domains. (C) 2016 Elsevier Ltd. All rights reserved. (AU)

Processo FAPESP: 15/22406-4 - Otimização de reconstrução do espaço-fase de séries temporais
Beneficiário:Lucas de Carvalho Pagliosa
Modalidade de apoio: Bolsas no Brasil - Doutorado
Processo FAPESP: 14/13323-5 - Abordagem baseada na estabilidade de algoritmos de agrupamento de dados para garantir a detecção de mudanças de conceito em fluxos de dados
Beneficiário:Rodrigo Fernandes de Mello
Modalidade de apoio: Auxílio à Pesquisa - Regular