Advanced search
Start date
Betweenand


Exploratory Data Analysis in Electronic Health Records Graphs: Intuitive Features and Visualization Tools

Full text
Author(s):
Show less -
Cazzolato, Mirela T. ; Gutierrez, Marco Antonio ; Traina, Cactano, Jr. ; Faloutsos, Christos ; Traina, Agma J. M. ; Almeida, JR ; Spiliopoulou, M ; Andrades, JAB ; Placidi, G ; Gonzalez, AR ; Sicilia, R ; Kane, B
Total Authors: 12
Document type: Journal article
Source: 2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS; v. N/A, p. 6-pg., 2023-01-01.
Abstract

Given a large, unlabeled set of Electronic Health Records (EHRs) acquired from multiple hospitals, how can we analyze the available entities and identify relationships in the data? Also, how can we perform Exploratory Data Analysis (EDA) over such EHR data? Many medical institutions generate EHRs as tabular data with entities and attributes in common. However, due to a large number of records, attributes, and high cardinality, exploring the different datasets and finding patterns and insights become laborious and prone to errors. In this work, we propose GraF-EDA for EDA over EHR data from different institutions. GraF-EDA models EHRs as time-evolving graphs, allowing the interoperability of such data into a single representation. We extract meaningful features from the graph nodes and provide intuitive visualizations to improve data explainability. We evaluate GraF-EDA with four COVID-19 datasets from hospitals of the Sao Paulo state, Brazil, resulting in million-scale graphs. Our method identified correlations, similarities and dissimilarities among medical treatments, exams, clinics, and outcomes. With the visual tools provided by GraF-EDA, we were able to spot cases of interest and check more details about them. Our results indicate that GraF-EDA is a fast, effective, open-sourced tool for EDA of EHRs from multiple institutions. (AU)

FAPESP's process: 16/17078-0 - Mining, indexing and visualizing Big Data in clinical decision support systems (MIVisBD)
Grantee:Agma Juci Machado Traina
Support Opportunities: Research Projects - Thematic Grants
FAPESP's process: 21/11403-5 - Mining multimodal records: explainable patterns and anomalies discovery
Grantee:Mirela Teixeira Cazzolato
Support Opportunities: Scholarships abroad - Research Internship - Post-doctor
FAPESP's process: 20/11258-2 - Interoperability and similarity queries on medical databases
Grantee:Mirela Teixeira Cazzolato
Support Opportunities: Scholarships in Brazil - Post-Doctoral