Busca avançada
Ano de início
Entree


Classifying and Counting with Recurrent Contexts

Texto completo
Autor(es):
Reis, Denis ; Maletzke, Andre ; Silva, Diego F. ; Batista, Gustavo E. A. P. A. ; ACM
Número total de Autores: 5
Tipo de documento: Artigo Científico
Fonte: KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING; v. N/A, p. 10-pg., 2018-01-01.
Resumo

Many real-world applications in the batch and data stream settings with data shift pose restrictions to the access to class labels after the deployment of a classification or quantification model. However, a significant portion of the data stream literature assumes that actual labels are instantaneously available after issuing their corresponding classifications. In this paper, we explore a different set of assumptions without relying on the availability of class labels. We assume that, although the distribution of the data may change over time, it will switch between one of a handful of well-known distributions. Still, we allow the proportions of the classes to vary. In these conditions, we propose the first method that can accurately identify the correct context of data samples and simultaneously estimate the proportion of the positive class. This estimate can be further used to adjust a classification decision threshold and improve classification accuracy. Finally, the method is very efficient regarding time and memory requirements, fitting data stream applications. (AU)

Processo FAPESP: 16/04986-6 - Armadilhas e sensores inteligentes: uma abordagem inovadora para controle de insetos peste e vetores de doenças
Beneficiário:Gustavo Enrique de Almeida Prado Alves Batista
Modalidade de apoio: Auxílio à Pesquisa - Programa eScience e Data Science - Regular