Linguistic description of complementarity for Multi-document Summarization
Investigation of automatic multi-document summarization methods based on conceptua...
Methods for Redundancy Detection in Multidocument Summarization
Grant number: | 13/13107-8 |
Support Opportunities: | Scholarships in Brazil - Scientific Initiation |
Start date: | September 01, 2013 |
End date: | August 31, 2014 |
Field of knowledge: | Linguistics, Literature and Arts - Linguistics - Linguistic Theory and Analysis |
Principal Investigator: | Ariani Di Felippo |
Grantee: | Vinícius Felix dos Santos |
Host Institution: | Centro de Educação e Ciências Humanas (CECH). Universidade Federal de São Carlos (UFSCAR). São Carlos , SP, Brazil |
Abstract Several studies have been demonstrated that human summaries produced from collections of news from different sources with the same topic (i.e., multi-document summaries) have specific aspects based on their category. The "aspects" are defined as basic units of information. For example, a summary of "natural disasters" news has the following aspects: what, when, where, why, who_affected, damages, and countermeasures. Based on that, some methods of Automatic Multi-document Summarization have been produced summaries by selecting sentences from sources texts that convey the aspects found for its category. This project aims at: (I) revising the annotation of the aspects in the 50 human multi-document summaries of the Portuguese CSTNews corpus, and (II) annotating the aspects in the 140 source texts of the CSTNews corpus. The annotation revision is motivated by the fact that there is no a clear and well-defined theory of aspects, and then the criteria for the identification and definition of these aspects needed to be refined. The annotation of the CSTNews corpus is essential to develop aspect-based multi-document summarization methods especially for Portuguese, which requires a corpus of annotated source texts. Being part of the SUSTENTO (2012/13246-5 FAPESP/CNPq 483231/2012-6) project, which aims at generating linguistic knowledge for Automatic Multi-document Summarization of Portuguese language, this undergraduate research project aims to contribute to refine the theoretical knowledge on textual aspects and characterize the human multi-document summaries of CSTNews. (AU) | |
News published in Agência FAPESP Newsletter about the scholarship: | |
More itemsLess items | |
TITULO | |
Articles published in other media outlets ( ): | |
More itemsLess items | |
VEICULO: TITULO (DATA) | |
VEICULO: TITULO (DATA) | |