Busca avançada
Ano de início
Entree


Walk-Based Diversification for Data Summarization

Texto completo
Autor(es):
Oliva, Samuel Zanferdini ; Felipe, Joaquim Cezar ; Rocha, A ; Ferras, C ; Marin, CEM ; Garcia, VHM
Número total de Autores: 6
Tipo de documento: Artigo Científico
Fonte: INFORMATION TECHNOLOGY AND SYSTEMS, ICITS 2020; v. 1137, p. 10-pg., 2020-01-01.
Resumo

Due to the large amount of data stored in current information systems, new strategies are required in order to extract useful information from databases. Hereupon, data summarization is an interesting process that allows reducing a large database maintaining just the relevant parts of the whole collection. In this study, we propose a new approach for data summarization based on a recently proposed tourist walk diversification method. This approach allows setting two ways of selecting elements considering density and hyper volume of each class. In order to evaluate the proposed approach, we compared it with two known methods of the literature considering one real world dataset and one artificial dataset. The artificial dataset was created considering different data distribution aspects. The conducted experiments outcomes demonstrate that our proposed data summarization approach is a promising alternative for addressing the problem of selecting elements from large databases considering different aspects of distribution. (AU)

Processo FAPESP: 16/17078-0 - Mineração, indexação e visualização de Big Data no contexto de sistemas de apoio à decisão clínica (MIVisBD)
Beneficiário:Agma Juci Machado Traina
Modalidade de apoio: Auxílio à Pesquisa - Temático