Advanced search
Start date
Betweenand

Complexity-invariance for classification, clustering and motif discovery in time series

Abstract

Recently, there is an increasing interest in time series processing due to the large number of application domains that generate data with such property. Such interest can be measured by the vast amount of methods recently proposed in literature to tasks such as classification, clustering, summarization, abnormality detection and motif discovery. Recent studies have shown for several problems that methods based on similarity present an efficacy that is hardly surpassed, even when compared to more sophisticated methods. This is mainly due to the fact that the community has studied and proposed several invariances to distance measures for time series. The invariances make the distance measures ignore certain undesired data properties. The most well-known example is the invariance to local differences in time scale, obtained with the warping technique. Other invariances include the invariance to differences in amplitude and offset, phase and occlusion. Recently, we demonstrated to the scientific community that time series similarity classification methods can be largely benefited by a new invariance: complexity invariance. The main objective of this research project is to investigate new complexity-invariant distance measures and assess how such measures can improve the efficacy especially of clustering and motif discovery algorithms. (AU)

Articles published in Agência FAPESP Newsletter about the research grant:
More itemsLess items
Articles published in other media outlets ( ):
More itemsLess items
VEICULO: TITULO (DATA)
VEICULO: TITULO (DATA)

Scientific publications (4)
(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)
BATISTA, GUSTAVO; SILVA, DIEGO; PRATI, RONALDO; WANI, MA; KHOSHGOFTAAR, T; ZHU, X; SELIYA, N. An Experimental Design to Evaluate Class Imbalance Treatment Methods. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 2, v. N/A, p. 7-pg., . (12/07295-3, 11/04054-2)
PRATI, RONALDO C.; BATISTA, GUSTAVO E. A. P. A.; SILVA, DIEGO F.. Class imbalance revisited: a new experimental setup to assess the performance of treatment methods. KNOWLEDGE AND INFORMATION SYSTEMS, v. 45, n. 1, p. 247-270, . (12/07295-3)
BATISTA, GUSTAVO E. A. P. A.; KEOGH, EAMONN J.; TATAW, OBEN MOSES; DE SOUZA, VINICIUS M. A.. CID: an efficient complexity-invariant distance for time series. DATA MINING AND KNOWLEDGE DISCOVERY, v. 28, n. 3, p. 634-669, . (09/06349-0, 12/07295-3)
SILVA, DIEGO FURTADO; ALVES DE SOUZA, VINICIUS MOURAO; PRADO ALVES BATISTA, GUSTAVO ENRIQUE DE ALMEIDA. A comparative study between MFCC and LSF coefficients in automatic recognition of isolated digits pronounced in Portuguese and English. ACTA SCIENTIARUM-TECHNOLOGY, v. 35, n. 4, p. 621-628, . (11/04054-2, 12/07295-3, 11/17698-5, 12/50714-7)