Advanced search
Start date
Betweenand


Non-stationary and unpredictable data distributions in classification and quantification

Full text
Author(s):
Denis Moreira dos Reis
Total Authors: 1
Document type: Doctoral Thesis
Press: São Carlos.
Institution: Universidade de São Paulo (USP). Instituto de Ciências Matemáticas e de Computação (ICMC/SB)
Defense date:
Examining board members:
Elaine Parros Machado de Sousa; Renato Martins Assunção; Robson Leonardo Ferreira Cordeiro; Fabrício Enembreck
Advisor: Gustavo Enrique de Almeida Prado Alves Batista; Elaine Parros Machado de Sousa
Abstract

In the last years, we observed a crescent academic interest on nonstationary data. On the one hand, differences between the data that was used to induce a model and the data that is found after the model is deployed cause a decrease of performance for several tasks, such as classification. On the other hand, in several tasks, such as quantification, we are explicitly interested in measuring how a distribution changes over time. For any of these problems, however, we generally run into solutions that rely on strong assumptions, which are impractical or even impracticable in real world applications. In this thesis, we provide solutions that rely on less restrictive and/or more realistic assumptions in order to allow such methods to be employed in real applications. In the concept drift detection area, we introduce unsupervised drift detection methods that allow for performing classification and quantification without ever requesting true labels after deployment. In the quantification area, we bootstrap a new research topic called one-class quantification. Similarly to one-class classification, in one-class quantification we are able to avoid strong assumptions regarding the negative class, which is deemed unpredictable. Our experimental results are promising and provide favorable evidences about the viability of solutions that are focused on solving real-world problems. (AU)

FAPESP's process: 17/22896-7 - Unsupervised Context Detection of Streaming Data For Classification
Grantee:Denis Moreira dos Reis
Support Opportunities: Scholarships in Brazil - Doctorate