Advanced search
Start date

Text phylogeny

Grant number: 14/13433-5
Support Opportunities:Scholarships in Brazil - Scientific Initiation
Effective date (Start): September 01, 2014
Effective date (End): August 31, 2015
Field of knowledge:Physical Sciences and Mathematics - Computer Science
Principal Investigator:Zanoni Dias
Grantee:Guilherme Duarte Marmerola
Host Institution: Instituto de Computação (IC). Universidade Estadual de Campinas (UNICAMP). Campinas , SP, Brazil


Content redistribution throughout the web, by lawful or unlawful means, has attracted attention in recent years in fields like forensics, copyright enforcement, security and social network analysis. Very often the digital objects involved in this process go through an evolutionary chain, in which different versions of an original document emerge. In this case, the relationship among the documents can be represented by a directed acyclic graph, known in the field as a phylogenetic tree, due to the direct analogy with the ones used in evolution studies in Biology. From the analysis of these trees, it is possible to discover clues pointing to criminals, or gain insights about how information spreads through the web. Thus, the automatic reconstruction of phylogeny trees associated to multimedia presents itself as an important challenge, with great potential for generating value and benefits to the society. The sub-field which studies this problem is known as Multimedia Phylogeny, and it has achieved significant results in some types of media, namely images and video. In preliminary tests, done by the candidate, promising results were achieved in another, but less explored, particular type of media: text documents. In this project, we propose to expand text phylogeny research, using synthetic and real data, aiming to improve the performance of the existent reconstruction process, addressing the problems that were found the most challenging in our preliminary studies.

News published in Agência FAPESP Newsletter about the scholarship:
Articles published in other media outlets (0 total):
More itemsLess items

Scientific publications
(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)
MARMEROLA, GUILHERME D.; OIKAWA, MARINA A.; DIAS, ZANONI; GOLDENSTEIN, SIOME; ROCHA, ANDERSON. On the Reconstruction of Text Phylogeny Trees: Evaluation and Analysis of Textual Relationships. PLoS One, v. 11, n. 12, . (14/19401-8, 15/19222-9, 14/13433-5, 13/08293-7, 14/03535-5)

Please report errors in scientific publications list by writing to: