Multimodal Models for Images and 3D Representations in a Unified Vision and Langua...
Noisy scene graph with self-supervised learning on graph neural network for visual...
Multimodal Domain Adaptation and Deep Learning for Medical Image Analysis
Brazil in focus: a multimodal large language model aware of the brazilian context ...