Advanced search
Start date
Betweenand
(Reference retrieved automatically from Web of Science through information on FAPESP grant and its corresponding number as mentioned in the publication by the authors.)

Explaining dimensionality reduction results using Shapley values

Full text
Author(s):
Marcilio-Jr, Wilson E. ; Eler, Danilo M. [1]
Total Authors: 2
Affiliation:
[1] Marcilio-Jr, Jr., Wilson E., Sao Paulo State Univ UNESP, Fac Sci & Technol, BR-19060900 Presidente Prudente, SP - Brazil
Total Affiliations: 1
Document type: Journal article
Source: EXPERT SYSTEMS WITH APPLICATIONS; v. 178, SEP 15 2021.
Web of Science Citations: 0
Abstract

Dimensionality reduction (DR) techniques have been consistently supporting high-dimensional data analysis in various applications. Besides the patterns uncovered by these techniques, the interpretation of DR results based on each feature's contribution to the low-dimensional representation supports new finds through exploratory analysis. Current literature approaches designed to interpret DR techniques do not explain the features' contributions well since they focus only on the low-dimensional representation or do not consider the relationship among features. This paper presents ClusterShapley to address these problems, using Shapley values to generate explanations of dimensionality reduction techniques and interpret these algorithms using a cluster-oriented analysis. ClusterShapley explains the formation of clusters and the meaning of their relationship, which is useful for exploratory data analysis in various domains. We propose novel visualization techniques to guide the interpretation of features' contributions on clustering formation and validate our methodology through case studies of publicly available datasets. The results demonstrate our approach's interpretability and analysis power to generate insights about pathologies and patients in different conditions using DR results. (AU)

FAPESP's process: 18/17881-3 - Multilevel visual representation to assist the exploration of data sets
Grantee:Danilo Medeiros Eler
Support Opportunities: Regular Research Grants
FAPESP's process: 18/25755-8 - Multilevel visual representation to assist the exploration of data sets
Grantee:Wilson Estécio Marcílio Júnior
Support Opportunities: Scholarships in Brazil - Technical Training Program - Technical Training