Busca avançada
Ano de início
Entree


Artist Similarity Based on Heterogeneous Graph Neural Networks

Texto completo
Autor(es):
da Silva, Angelo Cesar Mendes ; Silva, Diego Furtado ; Marcacini, Ricardo Marcondes
Número total de Autores: 3
Tipo de documento: Artigo Científico
Fonte: IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING; v. 32, p. 13-pg., 2024-01-01.
Resumo

Music streaming platforms rely on recommending similar artists to maintain user engagement, with artists benefiting from these suggestions to boost their popularity. Another important feature is music information retrieval, allowing users to explore new content. In both scenarios, performance depends on how to compute the similarity between musical content. This is a challenging process since musical data is inherently multimodal, containing textual and audio data. We propose a novel graph-based artist representation that integrates audio, lyrics features, and artist relations. Thus, a multimodal representation on a heterogeneous graph is proposed, along with a network regularization process followed by a GNN model to aggregate multimodal information into a more robust unified representation. The proposed method explores this final multimodal representation for the task of artist similarity as a link prediction problem. Our method introduces a new importance matrix to emphasize related artists in this multimodal space. We compare our approach with other strong baselines based on combining input features, importance matrix construction, and GNN models. Experimental results highlight the superiority of multimodal representation through the transfer learning process and the value of the importance matrix in enhancing GNN models for artist similarity. (AU)

Processo FAPESP: 19/07665-4 - Centro de Inteligência Artificial
Beneficiário:Fabio Gagliardi Cozman
Modalidade de apoio: Auxílio à Pesquisa - Programa eScience e Data Science - Centros de Pesquisa em Engenharia
Processo FAPESP: 22/14903-1 - Transferência de Conhecimento em Aprendizado Multimodal para Reconhecimento de Emoções em Vídeos
Beneficiário:Gabriel Natal Coutinho
Modalidade de apoio: Bolsas no Brasil - Iniciação Científica