Extractive multi-document summarization using multilayer networks

Tohalino, Jorge V.; Amancio, Diego R.

Texto completo
Autor(es):	Tohalino, Jorge V. ^[1] ; Amancio, Diego R. ^{[1, 2]} Número total de Autores: 2
Afiliação do(s) autor(es):	^[1] Univ Sao Paulo, Inst Math & Comp Sci, Sao Carlos, SP - Brazil ^[2] Indiana Univ, Sch Informat Comp & Engn, Bloomington, IN 47408 - USA Número total de Afiliações: 2
Tipo de documento:	Artigo Científico
Fonte:	PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS; v. 503, p. 526-539, AUG 1 2018.
Citações Web of Science:	3
Resumo
Huge volumes of textual information has been produced every single day. In order to organize and understand such large datasets, in recent years, summarization techniques have become popular. These techniques aims at finding relevant, concise and non-redundant content from such a big data. While network methods have been adopted to model texts in some scenarios, a systematic evaluation of multilayer network models in the multi document summarization task has been limited to a few studies. Here, we evaluate the performance of a multilayer-based method to select the most relevant sentences in the context of an extractive multi document summarization (MDS) task. In the adopted model, nodes represent sentences and edges are created based on the number of shared words between sentences. Differently from previous studies in multi-document summarization, we make a distinction between edges linking sentences from different documents (inter layer) and those connecting sentences from the same document (intra-layer). As a proof of principle, our results reveal that such a discrimination between intra- and inter-layer in a multilayered representation is able to improve the quality of the generated summaries. This piece of information could be used to improve current statistical methods and related textual models. (C) 2018 Elsevier B.V. All rights reserved. (AU)

Processo FAPESP:	17/13464-6 - Modelando grafos de citação e informação: uma abordagem baseada em redes complexas
Beneficiário:	Diego Raphael Amancio
Modalidade de apoio:	Bolsas no Exterior - Pesquisa


Processo FAPESP:	16/19069-9 - Classificação de documentos usando informações semânticas em redes complexas
Beneficiário:	Diego Raphael Amancio
Modalidade de apoio:	Auxílio à Pesquisa - Regular

URL curto