Multimodal Models for Images and 3D Representations in a Unified Vision and Langua...
Noisy scene graph with self-supervised learning on graph neural network for visual...
Brazil in focus: a multimodal large language model aware of the brazilian context ...
Multilingual and multimodal learning for Brazilian Portuguese
Expanded Sound Body (ESB): expanded spaces between sound, movement and technology ...