Advanced search
Start date
Betweenand


Semi-supervised Coarsening of Bipartite Graphs for Text Classification via Graph Neural Network

Full text
Author(s):
dos Santos, Nicolas Roque ; Minatel, Diego ; Baria Valejo, Alan Demetrius ; Lopes, Alneu de Andrade
Total Authors: 4
Document type: Journal article
Source: 2024 IEEE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, DSAA 2024; v. N/A, p. 10-pg., 2024-01-01.
Abstract

Graph Neural Networks (GNNs) have recently received extensive attention due to their applicability in a wide range of tasks, including drug discovery, text classification, traffic forecasting, hardware design, and recommendation. However, GNNs face significant challenges regarding scalability and the ability to handle large-scale graphs. Several strategies have been proposed to address these challenges, with multilevel optimization being a prominent approach. This technique involves hierarchically generating compact graphs through a coarsening step, applying a target algorithm (e.g., community detection) to the coarsest graph, and then projecting the initial solution back to the original input to derive the final solution. In this work, we introduce a method for graph-based text classification using GNNs. Our approach involves generating ten smaller graphs from an input bipartite graph using the coarsening step within the multilevel optimization and applying a GNN to learn node representations at various levels of granularity. Moreover, we propose a novel semi-supervised coarsening algorithm called Greedy Sorted Matching using Class and Split Information for Bipartite Graphs (GMCb). GMCb leverages class and train-test split information to select document nodes to merge during the graph coarsening step. We perform three types of reductions by either coarsening only one of the partitions of the graph or both simultaneously. Our method is evaluated on eight diverse datasets using three different GNN architectures. We assess each model's performance, memory usage, and training time to understand the impacts of graph reduction. Our experiments demonstrate that contracting the document nodes can improve performance while reducing memory consumption and training time. (AU)

FAPESP's process: 20/09835-1 - IARA - Artificial Intelligence in the Remaking of Urban Environments
Grantee:André Carlos Ponce de Leon Ferreira de Carvalho
Support Opportunities: Research Grants - Research Centers in Engineering Program
FAPESP's process: 22/09091-8 - Criminality, insecurity, and legitimacy: a transdisciplinary approach
Grantee:Luis Gustavo Nonato
Support Opportunities: Research Grants - eScience and Data Science Program - Thematic Grants
FAPESP's process: 21/06210-3 - Urban spaces-aware services via federated learning in intelligent transport systems
Grantee:Geraldo Pereira Rocha Filho
Support Opportunities: Regular Research Grants