Busca avançada
Ano de início
Entree
(Referência obtida automaticamente do Web of Science, por meio da informação sobre o financiamento pela FAPESP e o número do processo correspondente, incluída na publicação pelos autores.)

Attribute-based Decision Graphs: A framework for multiclass data classification

Texto completo
Autor(es):
Bertini Junior, Joao Roberto ; Nicoletti, Maria do Carmo ; Zhao, Liang
Número total de Autores: 3
Tipo de documento: Artigo Científico
Fonte: NEURAL NETWORKS; v. 85, p. 69-84, JAN 2017.
Citações Web of Science: 2
Resumo

Graph-based algorithms have been successfully applied in machine learning and data mining tasks. A simple but, widely used, approach to build graphs from vector-based data is to consider each data instance as a vertex and connecting pairs of it using a similarity measure. Although this abstraction presents some advantages, such as arbitrary shape representation of the original data, it is still tied to some drawbacks, for example, it is dependent on the choice of a pre-defined distance metric and is biased by the local information among data instances. Aiming at exploring alternative ways to build graphs from data, this paper proposes an algorithm for constructing a new type of graph, called Attribute-based Decision Graph - AbDG. Given a vector-based data set, an AbDG is built by partitioning each data attribute range into disjoint intervals and representing each interval as a vertex. The edges are then established between vertices from different attributes according to a pre-defined pattern. Classification is performed through a matching process among the attribute values of the new instance and AbDG. Moreover, AbDG provides an inner mechanism to handle missing attribute values, which contributes for expanding its applicability. Results of classification tasks have shown that AbDG is a competitive approach when compared to well-known multiclass algorithms. The main contribution of the proposed framework is the combination of the advantages of attribute-based and graph-based techniques to perform robust pattern matching data classification, while permitting the analysis the input data considering only a subset of its attributes. (C) 2016 Elsevier Ltd. All rights reserved. (AU)

Processo FAPESP: 12/00544-8 - Classificação de dados com distribuição estacionária, não estacionária e com escassez de dados rotulados por meio de abordagens baseadas em grafos
Beneficiário:João Roberto Bertini Junior
Modalidade de apoio: Bolsas no Brasil - Pós-Doutorado