Busca avançada
Ano de início
Entree


Verb Clustering for Brazilian Portuguese

Texto completo
Autor(es):
Scarton, Carolina ; Sun, Lin ; Kipper-Schuler, Karin ; Duran, Magali Sanches ; Palmer, Martha ; Korhonen, Anna ; Gelbukh, A
Número total de Autores: 7
Tipo de documento: Artigo Científico
Fonte: COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2014, PT I; v. 8403, p. 15-pg., 2014-01-01.
Resumo

Levin-style classes which capture the shared syntax and semantics of verbs have proven useful for many Natural Language Processing (NLP) tasks and applications. However, lexical resources which provide information about such classes are only available for a handful of worlds languages. Because manual development of such resources is extremely time consuming and cannot reliably capture domain variation in classification, methods for automatic induction of verb classes from texts have gained popularity. However, to date such methods have been applied to English and a handful of other, mainly resource-rich languages. In this paper, we apply the methods to Brazilian Portuguese - a language for which no Verb Net or automatic class induction work exists yet. Since Levin-style classification is said to have a strong cross-linguistic component, we use unsupervised clustering techniques similar to those developed for English without language-specific feature engineering. This yields interesting results which line up well with those obtained for other languages, demonstrating the cross-linguistic nature of this type of classification. However, we also discover and discuss issues which require specific consideration when aiming to optimise the performance of verb clustering for Brazilian Portuguese and other less-resourced languages. (AU)

Processo FAPESP: 10/03785-0 - VerbNet.Br: construção semiautomática de um léxico verbal online e independente de domínio para o português do Brasil
Beneficiário:Carolina Evaristo Scarton
Modalidade de apoio: Bolsas no Brasil - Mestrado
Processo FAPESP: 11/22882-0 - Classificação automática de verbos na taxonomia da VerbNet
Beneficiário:Carolina Evaristo Scarton
Modalidade de apoio: Bolsas no Exterior - Estágio de Pesquisa - Mestrado