Busca avançada
Ano de início
Entree


Word sense disambiguation: an evaluation study of semi-supervised approaches with word embeddings

Texto completo
Autor(es):
Sousa, Samuel ; Milios, Evangelos ; Berton, Lilian ; IEEE
Número total de Autores: 4
Tipo de documento: Artigo Científico
Fonte: 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN); v. N/A, p. 8-pg., 2020-01-01.
Resumo

Word Sense Disambiguation (WSD) is a well-known problem in the field of Natural Language Processing (NLP) related to automatically determining the most appropriate sense of words in context. Several machine learning-based approaches have been proposed to tackle the ambiguity of language, but the lack of labeled data to train supervised models made semi-supervised learning (SSL) appear as an attractive option. Furthermore, the use of word embeddings to enhance the results of NLP tasks was shown to be an efficient strategy. Thus, this paper aims at adapting semi-supervised algorithms for WSD using word embeddings from Word2Vec, FastText, and BERT models combined with part-of-speech tags as input. We conduct a systematic evaluation of four graph-based SSL models analyzing the influence of their hyperparameters on the results, as well as the distances to build the graphs, the percentages of labeled data, and the word embeddings architectural variations. As a result, we show that SSL algorithms which received 10% of labeled data are strong baselines on the subsets of nouns and adjectives. Additionally, these algorithms do not need further training to disambiguate new words, hence being competitive to supervised systems. (AU)

Processo FAPESP: 18/01722-3 - Aprendizado semissupervisionado via redes complexas: construção de redes, seleção e propagação de rótulos e aplicações
Beneficiário:Lilian Berton
Modalidade de apoio: Auxílio à Pesquisa - Regular
Processo FAPESP: 18/09465-0 - Desambiguação de palavras via algoritmos semissupervisionados baseados em grafos
Beneficiário:Samuel Bruno da Silva Sousa
Modalidade de apoio: Bolsas no Brasil - Mestrado