1-to-N Large Margin Classifier

Layza, Jaime Rocca; Pedrini, Helio; Torres, Ricardo da Silva; IEEE

Full text
Author(s):	Layza, Jaime Rocca ; Pedrini, Helio ; Torres, Ricardo da Silva ; IEEE Total Authors: 4
Document type:	Journal article
Source:	2020 33RD SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2020); v. N/A, p. 8-pg., 2020-01-01.
Abstract
Cross entropy with softmax is the standard loss function for classification in neural networks. However, this function can suffer from limitations on discriminative power, lack of generalization, and propensity to overfitting. In order to address these limitations, several approaches propose to enforce a margin on the top of the neural network specifically at the softmax function. In this work, we present a novel formulation that aims to produce generalization and noise label robustness not only by imposing a margin at the top of the neural network, but also by using the entire structure of the mini-batch data. Based on the distance used for SVM to obtain maximal margin, we propose a broader distance definition called 1-to-N distance and an approximated probability function as the basis for our proposed loss function. We perform empirical experimentation on MNIST, CIFAR-10, and ImageNet32 datasets to demonstrate that our loss function has better generalization and noise label robustness properties than the traditional cross entropy method, showing improvements in the following tasks: generalization robustness, robustness in noise label data, and robustness against adversarial examples attacks. (AU)

FAPESP's process:	17/12646-3 - Déjà vu: feature-space-time coherence from heterogeneous data for media integrity analytics and interpretation of events
Grantee:	Anderson de Rezende Rocha
Support Opportunities:	Research Projects - Thematic Grants

Short URL