Scholarship 11/22882-0 - Inteligência artificial, Processamento de linguagem natural - BV FAPESP
Advanced search
Start date
Betweenand

Automatic verb classification using VerbNet-style

Grant number: 11/22882-0
Support Opportunities:Scholarships abroad - Research Internship - Master's degree
Start date: April 01, 2012
End date: August 31, 2012
Field of knowledge:Physical Sciences and Mathematics - Computer Science - Computing Methodologies and Techniques
Principal Investigator:Sandra Maria Aluísio
Grantee:Carolina Evaristo Scarton
Supervisor: Anna Korhonen
Host Institution: Instituto de Ciências Matemáticas e de Computação (ICMC). Universidade de São Paulo (USP). São Carlos , SP, Brazil
Institution abroad: University of Cambridge, England  
Associated to the scholarship:10/03785-0 - VerbNet.Br: semiautomatic building of an online and domain-independent verb lexicon for the Brazilian Portuguese Language, BP.MS

Abstract

The manual building of computational lexical resources is impracticable, mainly because of the hard work and its time consuming. The Natural Language Process area intends to facilitate this task by using automatic and semiautomatic computational methods to build these resources. One approach uses machine learning from corpus. Another uses a cross-linguistic approach by using existing computational lexical resources to build a new resource. In this project we will exploit clustering techniques to find syntactic-semantic verbal classes for the Brazilian Portuguese language - according to the first approach. Specifically, we intend to follow de VerbNet-style to perform verb clustering. VerbNet is a verb lexicon with syntactic and semantic information about English verbs, domain-independent, based on Levin's verb classes and with mappings to the Princeton WordNet. The results of this project will be compared with the results of VerbNet.Br (Master project of this student). The VerbNet.Br is being built by using the mappings among existing computational lexical resources: VerbNet, WordNet and WordNet.Br, using a semiautomatic method - according to the cross-linguistic approach. The aim of the comparison to be made is to verify if the semiautomatic method (more expensive) presents more accurate results than the clustering method (cheaper). (AU)

News published in Agência FAPESP Newsletter about the scholarship:
More itemsLess items
Articles published in other media outlets ( ):
More itemsLess items
VEICULO: TITULO (DATA)
VEICULO: TITULO (DATA)

Scientific publications
(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)
SCARTON, CAROLINA; SUN, LIN; KIPPER-SCHULER, KARIN; DURAN, MAGALI SANCHES; PALMER, MARTHA; KORHONEN, ANNA; GELBUKH, A. Verb Clustering for Brazilian Portuguese. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2014, PT I, v. 8403, p. 15-pg., . (10/03785-0, 11/22882-0)