Scholarship 24/07969-1 - Inteligência artificial, Processamento de linguagem natural

Grant number:	24/07969-1
Support Opportunities:	Scholarships in Brazil - Doctorate (Direct)
Start date:	October 01, 2024
Status:	Discontinued
Field of knowledge:	Physical Sciences and Mathematics - Computer Science

Principal Investigator:	Sandra Eliza Fontes de Avila
Grantee:	Gabriel Oliveira dos Santos

Host Institution:	Instituto de Computação (IC). Universidade Estadual de Campinas (UNICAMP). Campinas , SP, Brazil

Associated scholarship(s):	25/00837-5 - Multilingual Vision Language Model with In-Context Learning Ability, BE.EP.DD

Abstract The Natural Language Processing (NLP) field has undergone significant transformations, primarily marked by Large Language Models (LLMs). However, an inherent limitation of these models is the incapacity of processing data modalities beyond text. To tackle this, in recent years, different Multimodal Large Language Models (MLLMs) have been proposed to extend the LLMs to other modalities further. Despite the advances, existing literature predominantly focuses on high-resource languages and neglects cultural aspects, perpetuating biases towards dominant worldviews. In light of this, this research proposes constructing an MLLM tailored to the Portuguese language and the Brazilian context. Specifically, we aim to develop a framework for building an MLLM capable of generating descriptions in Portuguese for images, allowing its knowledge about the Brazilian context to be continuously updated using the integration of a Retrieval Augmented Generation (RAG) pipeline into the MLLM. Furthermore, considering we are working under a data restriction scenario, we intend to leverage pre-trained LLMs specialized in Portuguese and propose a block that connects the visual encoder to the LLM so that our MLLM can perform tasks in the in-context learning fashion. Existing proposals in the literature are computationally expensive; in contrast, we aim to train our model at a low cost. Additionally, we aim to conduct a case study of our framework applied to identify manifestations of Brazilian culture. We hypothesize that conditioning caption generation based on Brazil-centered data will enhance our model's capacity to recognize elements from Brazilian culture. In this sense, we seek to contribute towards advancing the development of NLP beyond English-centric paradigms and empowering Brazilians with linguistically accurate and contextually adapted and relevant systems. (AU)

News published in Agência FAPESP Newsletter about the scholarship:
More items Less items
TITULO

Articles published in other media outlets ( ):
More items Less items
VEICULO: TITULO (DATA)
VEICULO: TITULO (DATA)

Scientific publications

(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)

CAETANO, CARLOS; DOS SANTOS, GABRIEL O.; PETRUCCI, CAIO; BARROS, ARTUR; LARANJEIRA, CAMILA; FERRAZ RIBEIRO, LEO SAMPAIO; DE MENDONCA, JULIA FERNANDES; DOS SANTOS, JEFERSSON A.; AVILA, SANDRA. Neglected Risks: The Disturbing Reality of Children's Images in Datasets and the Urgent Call for Accountability. PROCEEDINGS OF THE 2025 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, ACM FACCT 2025, v. N/A, p. 12-pg., 2025-01-01. (22/14690-8, 24/01210-3, 13/08293-7, 20/09838-0, 23/12086-9, 24/09375-1, 24/07969-1, 24/09372-2)

Short URL