Advanced search
Start date
Betweenand


Optimizing the class information divergence for transductive classification of texts using propagation in bipartite graphs

Full text
Author(s):
Faleiros, Thiago de Paulo ; Rossi, Rafael Geraldeli ; Lopes, Alneu de Andrade
Total Authors: 3
Document type: Journal article
Source: PATTERN RECOGNITION LETTERS; v. 87, p. 12-pg., 2017-02-01.
Abstract

Transductive classification is an useful way to classify a collection of unlabeled textual documents when only a small fraction of this collection can be manually labeled. Graph-based algorithms have aroused considerable interests in recent years to perform transductive classification since the graph-based representation facilitates label propagation through the graph edges. In a bipartite graph representation, nodes represent objects of two types, here documents and terms, and the edges between documents and terms represent the occurrences of the terms in the documents. In this context, the label propagation is performed from documents to terms and then from terms to documents iteratively. In this paper we propose a new graph-based transductive algorithm that use the bipartite graph structure to associate the available class information of labeled documents and then propagate these class information to assign labels for unlabeled documents. By associating the class information to edges linking documents to terms we guarantee that a single term can propagate different class information to its distinct neighbors. We also demonstrated that the proposed method surpasses the algorithms for transductive classification based on vector space model or graphs when only a small number of labeled documents is available. (C) 2016 Elsevier B.V. All rights reserved. (AU)

FAPESP's process: 15/14228-9 - Social Network Analysis and Mining
Grantee:Alneu de Andrade Lopes
Support Opportunities: Regular Research Grants
FAPESP's process: 11/12823-6 - Pattern extraction from textual document collections using heterogeneous networks
Grantee:Rafael Geraldeli Rossi
Support Opportunities: Scholarships in Brazil - Doctorate
FAPESP's process: 11/22749-8 - Challenges in exploratory visualization of multidimensional data: paradigms, scalability and applications
Grantee:Luis Gustavo Nonato
Support Opportunities: Research Projects - Thematic Grants