Advanced search
Start date
Betweenand
(Reference retrieved automatically from Web of Science through information on FAPESP grant and its corresponding number as mentioned in the publication by the authors.)

Multiclass from Binary: Expanding One-Versus-All, One-Versus-One and ECOC-Based Approaches

Full text
Author(s):
Rocha, Anderson [1] ; Goldenstein, Siome Klein [1]
Total Authors: 2
Affiliation:
[1] Univ Estadual Campinas, Inst Comp, BR-13083970 Sao Paulo - Brazil
Total Affiliations: 1
Document type: Journal article
Source: IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS; v. 25, n. 2, p. 289-302, FEB 2014.
Web of Science Citations: 51
Abstract

Recently, there has been a lot of success in the development of effective binary classifiers. Although many statistical classification techniques have natural multiclass extensions, some, such as the support vector machines, do not. The existing techniques for mapping multiclass problems onto a set of simpler binary classification problems run into serious efficiency problems when there are hundreds or even thousands of classes, and these are the scenarios where this paper's contributions shine. We introduce the concept of correlation and joint probability of base binary learners. We learn these properties during the training stage, group the binary leaner's based on their independence and, with a Bayesian approach, combine the results to predict the class of a new instance. Finally, we also discuss two additional strategies: one to reduce the number of required base learners in the multiclass classification, and another to find new base learners that might best complement the existing set. We use these two new procedures iteratively to complement the initial solution and improve the overall performance. This paper has two goals: finding the most discriminative binary classifiers to solve a multiclass problem and keeping up the efficiency, i.e., small number of base learners. We validate and compare the method with a diverse set of methods of the literature in several public available datasets that range from small (10 to 26 classes) to large multiclass problems (1000 classes) always using simple reproducible scenarios. (AU)

FAPESP's process: 10/05647-4 - Digital forensics: collection, organization, classification and analysis of digital evidences
Grantee:Anderson de Rezende Rocha
Support Opportunities: Research Grants - Young Investigators Grants