Advanced search
Start date
Betweenand

Multilingual and multimodal learning for Brazilian Portuguese

Grant number: 20/15995-1
Support Opportunities:Scholarships in Brazil - Scientific Initiation
Start date: April 01, 2021
End date: December 31, 2022
Field of knowledge:Physical Sciences and Mathematics - Computer Science
Principal Investigator:Helena de Medeiros Caseli
Grantee:Júlia Yumi Araújo Sato
Host Institution: Centro de Ciências Exatas e de Tecnologia (CCET). Universidade Federal de São Carlos (UFSCAR). São Carlos , SP, Brazil
Associated scholarship(s):22/04442-7 - Multilingual and multimodal learning for Brazilian portuguese, BE.EP.IC

Abstract

Humans constantly deal with multimodal information, that is, data sets of different modalities, such as text and image. For machines to process information similarly to humans, they must be able to process multimodal data and understand the joint relationship between these modalities, not just text or image in isolation, for example. This multimodal aspect of learning can be very useful in multilingual applications, that is, applications that involve two or more languages. This project proposes the extension of the VTLM (Visual Translation Language Modelling) framework, an approach recently published by Caglayan et al. (2021). To accomplish this goal, we will use the multimodal and multilingual dataset How2 (SANABRIA et al., 2018) in three parallel streams with aligned English-Portuguese-Visual information and explore more informed masking strategies for visual regions. Therefore, the basis of language in the image regions will be done between source and target languages together for the generation of a multilingual and multimodal model useful for several NLP applications. (AU)

News published in Agência FAPESP Newsletter about the scholarship:
More itemsLess items
Articles published in other media outlets ( ):
More itemsLess items
VEICULO: TITULO (DATA)
VEICULO: TITULO (DATA)

Scientific publications
(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)
SATO, JULIA; CASELI, HELENA; SPECIA, LUCIA; MARIANI, J; CALZOLARI, N; BECHET, F; BLACHE, P; CHOUKRI, K; CIERI, C; DECLERCK, T; et al. Multilingual and Multimodal Learning for Brazilian Portuguese. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, v. N/A, p. 9-pg., . (20/15995-1)