Advanced search
Start date
Betweenand


Verb Clustering for Brazilian Portuguese

Full text
Author(s):
Scarton, Carolina ; Sun, Lin ; Kipper-Schuler, Karin ; Duran, Magali Sanches ; Palmer, Martha ; Korhonen, Anna ; Gelbukh, A
Total Authors: 7
Document type: Journal article
Source: COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2014, PT I; v. 8403, p. 15-pg., 2014-01-01.
Abstract

Levin-style classes which capture the shared syntax and semantics of verbs have proven useful for many Natural Language Processing (NLP) tasks and applications. However, lexical resources which provide information about such classes are only available for a handful of worlds languages. Because manual development of such resources is extremely time consuming and cannot reliably capture domain variation in classification, methods for automatic induction of verb classes from texts have gained popularity. However, to date such methods have been applied to English and a handful of other, mainly resource-rich languages. In this paper, we apply the methods to Brazilian Portuguese - a language for which no Verb Net or automatic class induction work exists yet. Since Levin-style classification is said to have a strong cross-linguistic component, we use unsupervised clustering techniques similar to those developed for English without language-specific feature engineering. This yields interesting results which line up well with those obtained for other languages, demonstrating the cross-linguistic nature of this type of classification. However, we also discover and discuss issues which require specific consideration when aiming to optimise the performance of verb clustering for Brazilian Portuguese and other less-resourced languages. (AU)

FAPESP's process: 10/03785-0 - VerbNet.Br: semiautomatic building of an online and domain-independent verb lexicon for the Brazilian Portuguese Language
Grantee:Carolina Evaristo Scarton
Support Opportunities: Scholarships in Brazil - Master
FAPESP's process: 11/22882-0 - Automatic verb classification using VerbNet-style
Grantee:Carolina Evaristo Scarton
Support Opportunities: Scholarships abroad - Research Internship - Master's degree