Busca avançada
Ano de início
Entree


Building a Corpus for Personality-dependent Natural Language Understanding and Generation

Autor(es):
Mostrar menos -
Ramos, R. M. S. ; Neto, G. B. S. ; Silva, B. B. C. ; Monteiro, D. S. ; Paraboni, I ; Dias, R. F. S. ; Declerck, T ; Calzolari, N ; Choukri, K ; Cieri, C ; Hasida, K ; Isahara, H ; Maegaard, B ; Mariani, J ; Moreno, A ; Odijk, J ; Piperidis, S ; Tokunaga, T ; Goggi, S ; Mazo, H
Número total de Autores: 20
Tipo de documento: Artigo Científico
Fonte: PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018); v. N/A, p. 8-pg., 2018-01-01.
Resumo

The computational treatment of human personality - both for the recognition of personality traits from text and for the generation of text so as to reflect a particular set of traits - is central to the development of NLP applications. As a means to provide a basic resource for studies of this kind, this article describes the b5 corpus, a collection of controlled and free (non-topic specific) texts produced in different (e.g., referential or descriptive) communicative tasks, and accompanied by inventories of personality of their authors and additional demographics. The present discussion is mainly focused on the various corpus components and on the data collection task itself, but preliminary results of personality recognition from text are presented in order to illustrate how the corpus data may be reused. The b5 corpus aims to provide support for a wide range of NLP studies based on personality information and it is, to the best of our knowledge, the largest resource of this kind to be made available for research purposes in the Brazilian Portuguese language. (AU)

Processo FAPESP: 16/14223-0 - Tratamento Computacional da Personalidade Humana para Aplicações de Processamento de Língua Natural
Beneficiário:Ivandre Paraboni
Modalidade de apoio: Auxílio à Pesquisa - Regular