Clarice.ai: a linguistic intelligence aids to writing nonfiction based on web
Aspectos da reanálise sintático-semântica de sem que em perspectiva diacrônica
Full text | |
Author(s): |
Total Authors: 3
|
Affiliation: | [1] Correa Jr, Jr., Edilson A., Univ Sao Paulo, Inst Math & Comp Sci, Sao Carlos, SP - Brazil
Total Affiliations: 1
|
Document type: | Journal article |
Source: | PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS; v. 557, NOV 1 2020. |
Web of Science Citations: | 0 |
Abstract | |
We propose a framework to characterize documents based on their semantic flow. The proposed framework encompasses a network-based model that connected sentences based on their semantic similarity. Semantic fields are detected using standard community detection methods. As the story unfolds, transitions between semantic fields are represented in Markov networks, which in turn are characterized via network motifs (subgraphs). Here we show that different book characteristics (such as genre and publication date) are discriminated by the adopted semantic flow representation. Remarkably, even without a systematic optimization of parameters, philosophy and investigative books were discriminated with an accuracy rate of 92.5%. While the objective of this study is not to create a text classification method, we believe that semantic flow features could be used in traditional network-based models of texts that capture only syntactical/stylistic information to improve the characterization of texts. (C) 2020 Elsevier B.V. All rights reserved. (AU) | |
FAPESP's process: | 15/05676-8 - Development of new models for authorship recognition using complex networks |
Grantee: | Vanessa Queiroz Marinho |
Support Opportunities: | Scholarships in Brazil - Master |
FAPESP's process: | 16/19069-9 - Using semantical information to classify texts modelled as complex networks |
Grantee: | Diego Raphael Amancio |
Support Opportunities: | Regular Research Grants |