Advanced search
Start date
Betweenand

Using complex networks to recognize patterns in written texts

Grant number: 14/20830-0
Support Opportunities:Regular Research Grants
Start date: February 01, 2015
End date: January 31, 2017
Field of knowledge:Physical Sciences and Mathematics - Computer Science
Principal Investigator:Diego Raphael Amancio
Grantee:Diego Raphael Amancio
Host Institution: Instituto de Ciências Matemáticas e de Computação (ICMC). Universidade de São Paulo (USP). São Carlos , SP, Brazil

Abstract

Complex networks (CN) have been widely employed to model texts. Although some theoretical results have investigated the structural and functional properties of the language via the CN framework, the applicability of the topological analysis of CNs to solve linguistic problems have been restricted to a few studies. The proposed project aims at improving current CN-based models modeling traditional and novel applications. More specifically, we propose the combination of traditional and CN-based techniques based on time series analysis in order to improve the performance of natural language processing tasks, such as the authorship recognition and the disambiguation problems. Upon combining traditional and CN-based techniques in a hybrid way, we expect to generate competitive unsupervised and supervised classifiers. We also expect that the generated models will provide relevant insights into the language functional mechanisms. (AU)

Articles published in Agência FAPESP Newsletter about the research grant:
More itemsLess items
Articles published in other media outlets ( ):
More itemsLess items
VEICULO: TITULO (DATA)
VEICULO: TITULO (DATA)

Scientific publications (25)
(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)
AMANCIO, DIEGO R.; OLIVEIRA, JR., OSVALDO N.; COSTA, L. DA F.. Robustness of community structure to node removal. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, . (14/20830-0, 11/50761-2, 13/06717-4)
MARINHO, VANESSA QUEIROZ; HIRST, GRAEME; AMANCIO, DIEGO RAPHAEL. Labelled network subgraphs reveal stylistic subtleties in written texts. JOURNAL OF COMPLEX NETWORKS, v. 6, n. 4, p. 620-638, . (15/05676-8, 14/20830-0, 15/23803-7, 16/19069-9)
AMANCIO, DIEGO RAPHAEL. Comparing the topological properties of real and artificially generated scientific manuscripts. SCIENTOMETRICS, v. 105, n. 3, p. 1763-1779, . (14/20830-0)
DE ARRUDA, HENRIQUE FERRAZ; SILVA, FILIPI NASCIMENTO; MARINHO, VANESSA QUEIROZ; AMANCIO, DIEGO RAPHAEL; COSTA, LUCIANO DA FONTOURA. Representation of texts as complex networks: a mesoscopic approach. JOURNAL OF COMPLEX NETWORKS, v. 6, n. 1, p. 125-144, . (16/19069-9, 11/50761-2, 15/05676-8, 14/20830-0, 15/08003-4)
DE ARRUDA, HENRIQUE F.; SILVA, FILIPI N.; COSTA, LUCIANO DA F.; AMANCIO, DIEGO R.. Knowledge acquisition: A Complex networks approach. INFORMATION SCIENCES, v. 421, p. 154-166, . (14/20830-0, 16/19069-9, 11/50761-2, 15/08003-4)
DE ARRUDA, HENRIQUE F.; SILVA, FILIPI N.; COMIN, CESAR H.; AMANCIO, DIEGO R.; COSTA, LUCIANO DA F.. Connecting network science and information theory. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, v. 515, p. 641-648, . (15/18942-8, 16/19069-9, 14/20830-0, 15/08003-4, 11/50761-2)
DE ARRUDA, HENRIQUE F.; COSTA, LUCIANO DA F.; AMANCIO, DIEGO R.. Topic segmentation via community detection in complex networks. Chaos, v. 26, n. 6, . (14/20830-0, 11/50761-2)
CORREA, JR., EDILSON A.; SILVA, FILIPI N.; COSTA, LUCIANO DA F.; AMANCIO, DIEGO R.. Patterns of authors contribution in scientific manuscripts. Journal of Informetrics, v. 11, n. 2, p. 498-510, . (14/20830-0, 16/19069-9, 11/50761-2, 15/08003-4)
CORREA, JR., EDILSON A.; AMANCIO, DIEGO R.. Word sense induction using word embeddings and community detection in complex networks. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, v. 523, p. 180-190, . (14/20830-0, 17/13464-6, 16/19069-9)
MACHICAO, JEANETH; CORREA, JR., EDILSON A.; MIRANDA, GISELE H. B.; AMANCIO, DIEGO R.; BRUNO, ODEMIR M.. Authorship attribution based on Life-Like Network Automata. PLoS One, v. 13, n. 3, . (17/13464-6, 14/20830-0, 15/05899-7, 16/19069-9, 14/08026-1)
COMIN, CESAR H.; PERON, THOMAS; SILVA, FILIPI N.; AMANCIO, DIEGO R.; RODRIGUES, FRANCISCO A.; COSTA, LUCIANO DA F.. Complex systems: Features, similarity and connectivity. PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, v. 861, p. 1-41, . (15/22308-2, 15/08003-4, 16/23827-6, 18/09125-4, 16/19069-9, 14/20830-0, 13/26416-9)
AKIMUSHKIN, CAMILO; AMANCIO, DIEGO RAPHAEL; OLIVEIRA, JR., OSVALDO NOVAIS. Text Authorship Identified Using the Dynamics of Word Co-Occurrence Networks. PLoS One, v. 12, n. 1, . (14/20830-0, 16/19069-9)
MARINHO, VANESSA QUEIROZ; HIRST, GRAEME; AMANCIO, DIEGO RAPHAEL; IEEE. Authorship attribution via network motifs identification. PROCEEDINGS OF 2016 5TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2016), v. N/A, p. 6-pg., . (15/05676-8, 14/20830-0, 15/23803-7)
AMANCIO, DIEGO R.; OLIVEIRA, OSVALDO N., JR.; COSTA, L. DA F.. Robustness of community structure to node removal. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, v. N/A, p. 18-pg., . (13/06717-4, 14/20830-0, 11/50761-2)
AMANCIO, DIEGO R.. Authorship recognition via fluctuation analysis of network topology and word intermittency. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, v. N/A, p. 20-pg., . (14/20830-0)
AMANCIO, DIEGO R.. Authorship recognition via fluctuation analysis of network topology and word intermittency. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, . (14/20830-0)
RODRIGUEZ, MAYRA Z.; COMIN, CESAR H.; CASANOVA, DALCIMAR; BRUNO, ODEMIR M.; AMANCIO, DIEGO R.; COSTA, LUCIANO DA F.; RODRIGUES, FRANCISCO A.. Clustering algorithms: A comparative approach. PLoS One, v. 14, n. 1, . (16/19069-9, 14/20830-0, 15/18942-8, 15/22308-2, 14/08026-1, 18/09125-4, 11/50761-2)
AMANCIO, DIEGO RAPHAEL. A Complex Network Approach to Stylometry. PLoS One, v. 10, n. 8, . (14/20830-0)
AMANCIO, DIEGO RAPHAEL. Network analysis of named entity co-occurrences in written texts. EPL, v. 114, n. 5, . (14/20830-0)
AMANCIO, DIEGO R.; SILVA, FILIPI N.; COSTA, LUCIANO DA F.. Concentric network symmetry grasps authors' styles in word adjacency networks. EPL, v. 110, n. 6, . (14/20830-0, 11/50761-2)
SILVA, FILIPI N.; AMANCIO, DIEGO R.; BARDOSOVA, MARIA; COSTA, LUCIANO DA F.; OLIVEIRA, JR., OSVALDO N.. Using network science and text analytics to produce surveys in a scientific topic. Journal of Informetrics, v. 10, n. 2, p. 487-502, . (14/20830-0, 11/50761-2, 15/08003-4)
DE ARRUDA, HENRIQUE F.; COSTA, LUCIANO DA F.; AMANCIO, DIEGO R.. Using complex networks for text classification: Discriminating informative and imaginative documents. EPL, v. 113, n. 2, . (14/20830-0, 12/50986-7, 11/50761-2)
CORREA, JR., EDILSON A.; LOPES, ALNEU A.; AMANCIO, DIEGO R.. Word sense disambiguation: A complex network approach. INFORMATION SCIENCES, v. 442, p. 103-113, . (17/13464-6, 16/19069-9, 15/14228-9, 14/20830-0, 11/22749-8)
AKIMUSHKIN, CAMILO; AMANCIO, DIEGO R.; OLIVEIRA, JR., OSVALDO N.. On the role of words in the network structure of texts: Application to authorship attribution. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, v. 495, p. 49-58, . (14/20830-0, 13/14262-7, 16/19069-9)