Advanced search
Start date
Betweenand


Text characterization based on recurrence networks

Full text
Author(s):
Souza, Barbara C. E. ; Silva, Filipi N. ; de Arruda, Henrique F. ; da Silva, Giovana D. ; Costa, Luciano Da F. ; Amancio, Diego R.
Total Authors: 6
Document type: Journal article
Source: INFORMATION SCIENCES; v. 641, p. 15-pg., 2023-05-12.
Abstract

Several complex systems are characterized by exhibiting intricate properties that occur at multiple scales. These multi-scale characterizations are used in various applications. In particular, texts can be characterized by a hierarchical structure, which can be approached by using multi-scale concepts and methods. Here, we adopt an extension of the multi-scale, mesoscopic approach - hereafter referred to as a recurrence network - to represent text narratives, in which only the recurrent relationships among tagged parts of speech (subject, verb and direct object) are considered to establish connections among sequential pieces of text. The characterization of the texts was then achieved by considering scale-dependent complementary methods: accessibility and symmetry. To evaluate the potential of these concepts, we approached the problem of distinguishing between meaningful and meaningless texts and different literary genres (namely, fiction and non-fiction). A set of 300 books was considered and compared by using the above approaches. The recurrence network characterization was able to discriminate to some extent between real and meaningless and between the two genres assessed. Thus, our results indicate that recurrence networks are able to capture subtleties in book plots, suggesting that a similar methodology can be used in related networked applications. (AU)

FAPESP's process: 19/07665-4 - Center for Artificial Intelligence
Grantee:Fabio Gagliardi Cozman
Support Opportunities: Research Grants - Research Program in eScience and Data Science - Research Centers in Engineering Program
FAPESP's process: 21/01744-0 - Using complex networks to characterize and identify the success of literary works
Grantee:Giovana Daniele da Silva
Support Opportunities: Scholarships in Brazil - Scientific Initiation
FAPESP's process: 20/06271-0 - Combining complex networks and word embeddings in text classification tasks
Grantee:Diego Raphael Amancio
Support Opportunities: Regular Research Grants
FAPESP's process: 18/10489-0 - Transformations of complex networks and their implication in topology and dynamics of complex systems
Grantee:Henrique Ferraz de Arruda
Support Opportunities: Scholarships in Brazil - Post-Doctoral
FAPESP's process: 15/22308-2 - Intermediate representations in Computational Science for knowledge discovery
Grantee:Roberto Marcondes Cesar Junior
Support Opportunities: Research Projects - Thematic Grants