Deep Generative Models: The winning key for large and easily accessible ECG datasets?

Giuliana Monachino; Beatrice Zanchi; Luigi Fiorillo; Giulio Conte; Angelo Auricchio; Athina Tzovara; Francesca Dalia Faraci

doi:10.1016/j.compbiomed.2023.107655

Deep Generative Models: The winning key for large and easily accessible ECG datasets?

Comput Biol Med. 2023 Dec:167:107655. doi: 10.1016/j.compbiomed.2023.107655. Epub 2023 Nov 2.

Authors

Giuliana Monachino¹, Beatrice Zanchi², Luigi Fiorillo³, Giulio Conte⁴, Angelo Auricchio⁴, Athina Tzovara⁵, Francesca Dalia Faraci³

Affiliations

¹ Institute of Digital Technologies for Personalized Healthcare - MeDiTech, Department of Innovative Technologies, University of Applied Sciences and Arts of Southern Switzerland, Via la Santa 1, Lugano 6900, Switzerland; Institute of Informatics, University of Bern, Neubrückstrasse 10, Bern 3012, Switzerland. Electronic address: [email protected].
² Institute of Digital Technologies for Personalized Healthcare - MeDiTech, Department of Innovative Technologies, University of Applied Sciences and Arts of Southern Switzerland, Via la Santa 1, Lugano 6900, Switzerland; Department of Quantitative Biomedicine, University of Zurich, Schmelzbergstrasse 26, Zurich 8091, Switzerland.
³ Institute of Digital Technologies for Personalized Healthcare - MeDiTech, Department of Innovative Technologies, University of Applied Sciences and Arts of Southern Switzerland, Via la Santa 1, Lugano 6900, Switzerland.
⁴ Division of Cardiology, Fondazione Cardiocentro Ticino, Via Tesserete 48, Lugano 6900, Switzerland; Centre for Computational Medicine in Cardiology, Faculty of Informatics, Università della Svizzera Italiana, Via la Santa 1, Lugano 6900, Switzerland.
⁵ Institute of Informatics, University of Bern, Neubrückstrasse 10, Bern 3012, Switzerland; Sleep Wake Epilepsy Center | NeuroTec, Department of Neurology, Inselspital, Bern University Hospital, University of Bern, Freiburgstrasse 16, Bern 3010, Switzerland.

PMID: 37976830
DOI: 10.1016/j.compbiomed.2023.107655

Abstract

Large high-quality datasets are essential for building powerful artificial intelligence (AI) algorithms capable of supporting advancement in cardiac clinical research. However, researchers working with electrocardiogram (ECG) signals struggle to get access and/or to build one. The aim of the present work is to shed light on a potential solution to address the lack of large and easily accessible ECG datasets. Firstly, the main causes of such a lack are identified and examined. Afterward, the potentials and limitations of cardiac data generation via deep generative models (DGMs) are deeply analyzed. These very promising algorithms have been found capable not only of generating large quantities of ECG signals but also of supporting data anonymization processes, to simplify data sharing while respecting patients' privacy. Their application could help research progress and cooperation in the name of open science. However several aspects, such as a standardized synthetic data quality evaluation and algorithm stability, need to be further explored.

Keywords: Anonymization; Data augmentation; Data scarcity; Data sharing; Deep generative models; Diffusion models; ECG synthesis; GAN; Open science; Variational autoencoders.

Publication types

Review
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Artificial Intelligence*
Data Accuracy
Electrocardiography*
Heart
Humans