Advanced search
Start date
Betweenand

Using complex networks to recognize patterns in written texts

Grant number: 14/20830-0
Support type:Regular Research Grants
Duration: February 01, 2015 - January 31, 2017
Field of knowledge:Physical Sciences and Mathematics - Computer Science
Principal Investigator:Diego Raphael Amancio
Grantee:Diego Raphael Amancio
Home Institution: Instituto de Ciências Matemáticas e de Computação (ICMC). Universidade de São Paulo (USP). São Carlos , SP, Brazil

Abstract

Complex networks (CN) have been widely employed to model texts. Although some theoretical results have investigated the structural and functional properties of the language via the CN framework, the applicability of the topological analysis of CNs to solve linguistic problems have been restricted to a few studies. The proposed project aims at improving current CN-based models modeling traditional and novel applications. More specifically, we propose the combination of traditional and CN-based techniques based on time series analysis in order to improve the performance of natural language processing tasks, such as the authorship recognition and the disambiguation problems. Upon combining traditional and CN-based techniques in a hybrid way, we expect to generate competitive unsupervised and supervised classifiers. We also expect that the generated models will provide relevant insights into the language functional mechanisms. (AU)

Scientific publications (22)
(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)
COMIN, CESAR H.; PERON, THOMAS; SILVA, FILIPI N.; AMANCIO, DIEGO R.; RODRIGUES, FRANCISCO A.; COSTA, LUCIANO DA F. Complex systems: Features, similarity and connectivity. PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, v. 861, p. 1-41, MAY 25 2020. Web of Science Citations: 0.
CORREA, JR., EDILSON A.; AMANCIO, DIEGO R. Word sense induction using word embeddings and community detection in complex networks. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, v. 523, p. 180-190, JUN 1 2019. Web of Science Citations: 0.
DE ARRUDA, HENRIQUE F.; SILVA, FILIPI N.; COMIN, CESAR H.; AMANCIO, DIEGO R.; COSTA, LUCIANO DA F. Connecting network science and information theory. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, v. 515, p. 641-648, FEB 1 2019. Web of Science Citations: 0.
RODRIGUEZ, MAYRA Z.; COMIN, CESAR H.; CASANOVA, DALCIMAR; BRUNO, ODEMIR M.; AMANCIO, DIEGO R.; COSTA, LUCIANO DA F.; RODRIGUES, FRANCISCO A. Clustering algorithms: A comparative approach. PLoS One, v. 14, n. 1 JAN 15 2019. Web of Science Citations: 8.
MARINHO, VANESSA QUEIROZ; HIRST, GRAEME; AMANCIO, DIEGO RAPHAEL. Labelled network subgraphs reveal stylistic subtleties in written texts. JOURNAL OF COMPLEX NETWORKS, v. 6, n. 4, p. 620-638, AUG 2018. Web of Science Citations: 0.
CORREA, JR., EDILSON A.; LOPES, ALNEU A.; AMANCIO, DIEGO R. Word sense disambiguation: A complex network approach. INFORMATION SCIENCES, v. 442, p. 103-113, MAY 2018. Web of Science Citations: 6.
AKIMUSHKIN, CAMILO; AMANCIO, DIEGO R.; OLIVEIRA, JR., OSVALDO N. On the role of words in the network structure of texts: Application to authorship attribution. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, v. 495, p. 49-58, APR 1 2018. Web of Science Citations: 1.
MACHICAO, JEANETH; CORREA, JR., EDILSON A.; MIRANDA, GISELE H. B.; AMANCIO, DIEGO R.; BRUNO, ODEMIR M. Authorship attribution based on Life-Like Network Automata. PLoS One, v. 13, n. 3 MAR 22 2018. Web of Science Citations: 0.
DE ARRUDA, HENRIQUE FERRAZ; SILVA, FILIPI NASCIMENTO; MARINHO, VANESSA QUEIROZ; AMANCIO, DIEGO RAPHAEL; COSTA, LUCIANO DA FONTOURA. Representation of texts as complex networks: a mesoscopic approach. JOURNAL OF COMPLEX NETWORKS, v. 6, n. 1, p. 125-144, FEB 2018. Web of Science Citations: 3.
DE ARRUDA, HENRIQUE F.; SILVA, FILIPI N.; COSTA, LUCIANO DA F.; AMANCIO, DIEGO R. Knowledge acquisition: A Complex networks approach. INFORMATION SCIENCES, v. 421, p. 154-166, DEC 2017. Web of Science Citations: 13.
CORREA, JR., EDILSON A.; SILVA, FILIPI N.; COSTA, LUCIANO DA F.; AMANCIO, DIEGO R. Patterns of authors contribution in scientific manuscripts. Journal of Informetrics, v. 11, n. 2, p. 498-510, MAY 2017. Web of Science Citations: 8.
AKIMUSHKIN, CAMILO; AMANCIO, DIEGO RAPHAEL; OLIVEIRA, JR., OSVALDO NOVAIS. Text Authorship Identified Using the Dynamics of Word Co-Occurrence Networks. PLoS One, v. 12, n. 1 JAN 26 2017. Web of Science Citations: 16.
AMANCIO, DIEGO RAPHAEL. Network analysis of named entity co-occurrences in written texts. EPL, v. 114, n. 5 JUN 2016. Web of Science Citations: 2.
DE ARRUDA, HENRIQUE F.; COSTA, LUCIANO DA F.; AMANCIO, DIEGO R. Topic segmentation via community detection in complex networks. Chaos, v. 26, n. 6 JUN 2016. Web of Science Citations: 5.
SILVA, FILIPI N.; AMANCIO, DIEGO R.; BARDOSOVA, MARIA; COSTA, LUCIANO DA F.; OLIVEIRA, JR., OSVALDO N. Using network science and text analytics to produce surveys in a scientific topic. Journal of Informetrics, v. 10, n. 2, p. 487-502, MAY 2016. Web of Science Citations: 25.
DE ARRUDA, HENRIQUE F.; COSTA, LUCIANO DA F.; AMANCIO, DIEGO R. Using complex networks for text classification: Discriminating informative and imaginative documents. EPL, v. 113, n. 2 JAN 2016. Web of Science Citations: 6.
AMANCIO, DIEGO RAPHAEL. Comparing the topological properties of real and artificially generated scientific manuscripts. SCIENTOMETRICS, v. 105, n. 3, p. 1763-1779, DEC 2015. Web of Science Citations: 17.
AMANCIO, DIEGO RAPHAEL. A Complex Network Approach to Stylometry. PLoS One, v. 10, n. 8 AUG 27 2015. Web of Science Citations: 33.
AMANCIO, DIEGO R.; SILVA, FILIPI N.; COSTA, LUCIANO DA F. Concentric network symmetry grasps authors' styles in word adjacency networks. EPL, v. 110, n. 6 JUN 2015. Web of Science Citations: 8.
AMANCIO, DIEGO R. Authorship recognition via fluctuation analysis of network topology and word intermittency. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, MAR 2015. Web of Science Citations: 27.
AMANCIO, DIEGO R.; OLIVEIRA, JR., OSVALDO N.; COSTA, L. DA F. Robustness of community structure to node removal. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, MAR 2015. Web of Science Citations: 4.
AMANCIO, DIEGO R. Probing the Topological Properties of Complex Networks Modeling Short Written Texts. PLoS One, v. 10, n. 2 FEB 26 2015. Web of Science Citations: 22.

Please report errors in scientific publications list by writing to: cdi@fapesp.br.