Busca avançada
Ano de início
Entree


On Convolutional Autoencoders to Speed Up Similarity-Based Time Series Mining

Texto completo
Autor(es):
Mostrar menos -
Aragao da Silva, Yuri Gabriel ; Silva, Diego Furtado ; Wu, XT ; Jermaine, C ; Xiong, L ; Hu, XH ; Kotevska, O ; Lu, SY ; Xu, WJ ; Aluru, S ; Zhai, CX ; Al-Masri, E ; Chen, ZY ; Saltz, J
Número total de Autores: 14
Tipo de documento: Artigo Científico
Fonte: 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA); v. N/A, p. 10-pg., 2020-01-01.
Resumo

Time series represent a type of data that is increasingly present in research and industry applications. The most commonly used approach to obtain knowledge from these data is similarity-based data mining algorithms. However, in large volumes of data, applying these algorithms may be infeasible. Therefore, several techniques are proposed in the literature to accelerate the distance calculation between time series, such as algorithms' adaptations, indexing, and approximations. Recent results show that the union of techniques in different categories usually improves efficiency in time series mining tasks by similarity. In this way, we evaluate the dimensionality reduction through convolutional autoencoders to speed up time series distance calculation. To this end, we propose an offline training phase, with data previously observed in other domains, before applying the autoencoders. Our proposal is orthogonal to state-of-the-art tools so that we can use it as a complementary technique. We show that our proposal can lead similarity-based algorithms to execute up to two orders of magnitude faster than these tools alone, without loss of quality in the results obtained through all-pairwise distance calculation, similarity search, and motif discovery. (AU)

Processo FAPESP: 19/06080-2 - Aprendizado de representação baseado em stacked autoencoders para dados temporais e trajetórias
Beneficiário:Yuri Gabriel Aragão da Silva
Modalidade de apoio: Bolsas no Brasil - Iniciação Científica
Processo FAPESP: 17/24340-6 - Mineração de Dados para Análise Individual e de Equipe em Esportes Coletivos
Beneficiário:Diego Furtado Silva
Modalidade de apoio: Auxílio à Pesquisa - Regular