| Grant number: | 15/05676-8 |
| Support Opportunities: | Scholarships in Brazil - Master |
| Start date: | July 01, 2015 |
| End date: | July 31, 2017 |
| Field of knowledge: | Physical Sciences and Mathematics - Computer Science - Computing Methodologies and Techniques |
| Agreement: | Coordination of Improvement of Higher Education Personnel (CAPES) |
| Principal Investigator: | Diego Raphael Amancio |
| Grantee: | Vanessa Queiroz Marinho |
| Host Institution: | Instituto de Ciências Matemáticas e de Computação (ICMC). Universidade de São Paulo (USP). São Carlos , SP, Brazil |
| Associated scholarship(s): | 15/23803-7 - Authorship attribution with traditional methods and complex networks, BE.EP.MS |
Abstract The modeling of graphs and complex networks has been successfully applied in different fields, being the object of study in different areas including, for example, mathematics and computer science. The discovery that methods derived from the study of complex networks can be used to analyze texts in their different complexity levels provided great advances in natural language processing tasks. Examples of applications analyzed with the methods and tools of complex networks are the detection of relevant concepts, development of automatic summarizers and authorship recognition systems. The latter task, which is the focus of this research project, has been studied with some success through the representation of words adjacency networks that connect only the closest words. The purpose of this project is to extend the traditional modeling, choosing the optimal connection window to the problem, for a given training set. In addition, we intend to use the connectivity information of function words to complement the characterization of authors' style. Finally, we inted to create hybrid classifiers that are able to combine traditional factors with properties provided by the topological analysis of complex networks. By adapting, combining and improving the model, we aim not only improve the performance of textual stylistic characterization and authorship recognition systems, but also better understand what are the textual quantitative factors (measured through networks) that can be used in stylometry. The advances obtained during this project may be useful tostudy related applications, such as the analysis of stylistic inconsistences and plagiarism. (AU) | |
| News published in Agência FAPESP Newsletter about the scholarship: | |
| More itemsLess items | |
| TITULO | |
| Articles published in other media outlets ( ): | |
| More itemsLess items | |
| VEICULO: TITULO (DATA) | |
| VEICULO: TITULO (DATA) | |