Advanced search
Start date
Betweenand


Unsupervised Context Switch for Classification Tasks on Data Streams with Recurrent Concepts

Full text
Author(s):
dos Reis, Denis M. ; Maletzke, Andre G. ; Batista, Gustavo E. A. P. A. ; Assoc Comp Machinery
Total Authors: 4
Document type: Journal article
Source: 33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING; v. N/A, p. 7-pg., 2018-01-01.
Abstract

In this paper, we propose a novel approach to deal with concept drifts in data streams. We assume we can collect labeled data for different concepts in the training phase; however, in the test phase, no labels are available. Our approach consists of the storage of a limited number of classification models and the unsupervised identification of the most suitable one depending on the current concept. Several real-world classification problems with extreme label latency can use this setting. One example is the identification of insects species using wing-beat data gathered by sensors in field conditions. Flying insects have their wing-beat frequency indirectly affected by temperature, among other factors. In this work, we show that we can dynamically identify which is the most appropriate classification model, among other models from data with different temperature conditions, without any temperature information. We then expand the use of the method to other data sets and obtain accurate results. (AU)

FAPESP's process: 16/04986-6 - Intelligent traps and sensors: an innovative approach to control insect pests and disease vectors
Grantee:Gustavo Enrique de Almeida Prado Alves Batista
Support Opportunities: Research Grants - eScience and Data Science Program - Regular Program Grants