A rank aggregation framework for video multimodal geocoding

Li, Lin Tzy; Guimaraes Pedronette, Daniel Carlos; Almeida, Jurandy; Penatti, Otavio A. B.; Calumby, Rodrigo Tripodi; Torres, Ricardo da Silva

Texto completo
Autor(es):	Li, Lin Tzy ^{[1, 2]} ; Guimaraes Pedronette, Daniel Carlos ^{[3, 1]} ; Almeida, Jurandy ^[1] ; Penatti, Otavio A. B. ^[1] ; Calumby, Rodrigo Tripodi ^{[1, 4]} ; Torres, Ricardo da Silva ^[1] Número total de Autores: 6
Afiliação do(s) autor(es):	^[1] Univ Campinas UNICAMP, Inst Comp, RECOD Lab, BR-13083852 Campinas, SP - Brazil ^[2] CPqD Fdn, Telecommun Res & Dev Ctr, BR-13086902 Campinas, SP - Brazil ^[3] Univ Estadual Paulista UNESP, Dept Stat Appl Math & Comp, BR-13506900 Rio Claro, SP - Brazil ^[4] Univ Feira Santana UEFS, Dept Exact Sci, BR-44036900 Feira De Santana, BA - Brazil Número total de Afiliações: 4
Tipo de documento:	Artigo Científico
Fonte:	MULTIMEDIA TOOLS AND APPLICATIONS; v. 73, n. 3, p. 1323-1359, DEC 2014.
Citações Web of Science:	3
Resumo
This paper proposes a rank aggregation framework for video multimodal geocoding. Textual and visual descriptions associated with videos are used to define ranked lists. These ranked lists are later combined, and the resulting ranked list is used to define appropriate locations for videos. An architecture that implements the proposed framework is designed. In this architecture, there are specific modules for each modality (e. g, textual and visual) that can be developed and evolved independently. Another component is a data fusion module responsible for combining seamlessly the ranked lists defined for each modality. We have validated the proposed framework in the context of the MediaEval 2012 Placing Task, whose objective is to automatically assign geographical coordinates to videos. Obtained results show how our multimodal approach improves the geocoding results when compared to methods that rely on a single modality (either textual or visual descriptors). We also show that the proposed multimodal approach yields comparable results to the best submissions to the Placing Task in 2012 using no extra information besides the available development/training data. Another contribution of this work is related to the proposal of a new effectiveness evaluation measure. The proposed measure is based on distance scores that summarize how effective a designed/tested approach is, considering its overall result for a test dataset. (AU)

Processo FAPESP:	09/10554-8 - Explorando Dicionários Visuais em Buscas de Imagens na Web
Beneficiário:	Otávio Augusto Bizetto Penatti
Modalidade de apoio:	Bolsas no Brasil - Doutorado


Processo FAPESP:	11/11171-5 - Gestão de Séries Temporais do e-Phenology
Beneficiário:	Jurandy Gomes de Almeida Junior
Modalidade de apoio:	Bolsas no Brasil - Pós-Doutorado

URL curto