Busca avançada
Ano de início
Entree
(Referência obtida automaticamente do Web of Science, por meio da informação sobre o financiamento pela FAPESP e o número do processo correspondente, incluída na publicação pelos autores.)

Evaluating Richer Features and Varied Machine Learning Models for Subjectivity Classification of Book Review Sentences in Portuguese

Texto completo
Autor(es):
Belisario, Luana Balador [1] ; Ferreira, Luiz Gabriel [1] ; Salgueiro Pardo, Thiago Alexandre [1]
Número total de Autores: 3
Afiliação do(s) autor(es):
[1] Univ Sao Paulo, Interinst Ctr Computat Linguist NILC, Inst Math & Comp Sci, BR-13566590 Sao Carlos, SP - Brazil
Número total de Afiliações: 1
Tipo de documento: Artigo de Revisão
Fonte: INFORMATION; v. 11, n. 9 SEP 2020.
Citações Web of Science: 0
Resumo

Texts published on social media have been a valuable source of information for companies and users, as the analysis of this data helps improving/selecting products and services of interest. Due to the huge amount of data, techniques for automatically analyzing user opinions are necessary. The research field that investigates these techniques is called sentiment analysis. This paper focuses specifically on the task of subjectivity classification, which aims to predict whether a text passage conveys an opinion. We report the study and comparison of machine learning methods of different paradigms to perform subjectivity classification of book review sentences in Portuguese, which have shown to be a challenging domain in the area. Specifically, we explore richer features for the task, using several lexical, centrality-based and discourse features. We show the contributions of the different feature sets and evidence that the combination of lexical, centrality-based and discourse features produce better results than any of the feature sets individually. Additionally, by analyzing the achieved results and the acquired knowledge by some symbolic machine learning methods, we show that some discourse relations may clearly signal subjectivity. Our corpus annotation also reveals some distinctive discourse structuring patterns for sentence subjectivity. (AU)

Processo FAPESP: 18/11479-9 - Classificação de subjetividade para a língua portuguesa
Beneficiário:Luiz Gabriel Ferreira
Modalidade de apoio: Bolsas no Brasil - Iniciação Científica