Advanced search
Start date

Complexity-invariance for classification, clustering and motif discovery in time series


Recently, there is an increasing interest in time series processing due to the large number of application domains that generate data with such property. Such interest can be measured by the vast amount of methods recently proposed in literature to tasks such as classification, clustering, summarization, abnormality detection and motif discovery. Recent studies have shown for several problems that methods based on similarity present an efficacy that is hardly surpassed, even when compared to more sophisticated methods. This is mainly due to the fact that the community has studied and proposed several invariances to distance measures for time series. The invariances make the distance measures ignore certain undesired data properties. The most well-known example is the invariance to local differences in time scale, obtained with the warping technique. Other invariances include the invariance to differences in amplitude and offset, phase and occlusion. Recently, we demonstrated to the scientific community that time series similarity classification methods can be largely benefited by a new invariance: complexity invariance. The main objective of this research project is to investigate new complexity-invariant distance measures and assess how such measures can improve the efficacy especially of clustering and motif discovery algorithms. (AU)

Articles published in Agência FAPESP Newsletter about the research grant:
Articles published in other media outlets (0 total):
More itemsLess items

Scientific publications
(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)
PRATI, RONALDO C.; BATISTA, GUSTAVO E. A. P. A.; SILVA, DIEGO F. Class imbalance revisited: a new experimental setup to assess the performance of treatment methods. KNOWLEDGE AND INFORMATION SYSTEMS, v. 45, n. 1, p. 247-270, OCT 2015. Web of Science Citations: 48.
BATISTA, GUSTAVO E. A. P. A.; KEOGH, EAMONN J.; TATAW, OBEN MOSES; DE SOUZA, VINICIUS M. A. CID: an efficient complexity-invariant distance for time series. DATA MINING AND KNOWLEDGE DISCOVERY, v. 28, n. 3, p. 634-669, MAY 2014. Web of Science Citations: 76.
SILVA, DIEGO FURTADO; ALVES DE SOUZA, VINICIUS MOURAO; PRADO ALVES BATISTA, GUSTAVO ENRIQUE DE ALMEIDA. A comparative study between MFCC and LSF coefficients in automatic recognition of isolated digits pronounced in Portuguese and English. ACTA SCIENTIARUM-TECHNOLOGY, v. 35, n. 4, p. 621-628, 2013. Web of Science Citations: 2.

Please report errors in scientific publications list by writing to: