Multimodal Models for Images and 3D Representations in a Unified Vision and Langua...
Visual question answering task with graph convolution networks
Brazil in focus: a multimodal large language model aware of the brazilian context ...
Image Foresting Transform using non-smooth connectivity functions and its applicat...