Advanced search
Start date
Betweenand


C-Rank: A Concept Linking Approach to Unsupervised Keyphrase Extraction

Full text
Author(s):
Lucca Tosi, Mauro Dalle ; dos Reis, Julio Cesar ; Garoufallou, E ; Fallucchi, F ; DeLuca, EW
Total Authors: 5
Document type: Journal article
Source: METADATA AND SEMANTIC RESEARCH, MTSR 2019; v. 1057, p. 12-pg., 2019-01-01.
Abstract

Keyphrase extraction is the task of identifying a set of phrases that best represent a natural language document. It is a fundamental and challenging task that assists publishers to index and recommend relevant documents to readers. In this article, we introduce C-Rank, a novel unsupervised approach to automatically extract keyphrases from single documents by using concept linking. Our method explores Babelfy to identify candidate keyphrases, which are weighted based on heuristics and their centrality inside a co-occurrence graph where keyphrases appear as vertices. It improves the results obtained by graph-based techniques without training nor background data inserted by users. Evaluations are performed on SemEval and INSPEC datasets, producing competitive results with state-of-the-art tools. Furthermore, C-Rank generates intermediate structures with semantically annotated data that can be used to analyze larger textual compendiums, which might improve domain understatement and enrich textual representation methods. (AU)

FAPESP's process: 17/02325-5 - EvOLoD: linked data evolution on the Semantic Web
Grantee:Julio Cesar dos Reis
Support Opportunities: Research Grants - Young Investigators Grants
FAPESP's process: 13/08293-7 - CCES - Center for Computational Engineering and Sciences
Grantee:Munir Salomao Skaf
Support Opportunities: Research Grants - Research, Innovation and Dissemination Centers - RIDC