TY - JOUR
T1 - Deep representation learning of electrocardiogram reveals biological insights in cardiac phenotypes and cardiovascular diseases
AU - Yeung, Ming Wai
AU - van de Leur, Rutger R.
AU - Benjamins, Jan Walter
AU - Vessies, Melle B.
AU - Ruijsink, Bram
AU - Puyol-Antón, Esther
AU - van Tintelen, J. Peter
AU - Verweij, Niek
AU - van Es, René
AU - van der Harst, Pim
N1 - Publisher Copyright:
© 2025 The Author(s)
PY - 2025/8/15
Y1 - 2025/8/15
N2 - Conventional approaches to analyzing electrocardiograms (ECG) in discrete parameters (such as the PR interval) ignored the high dimensionality of data omitted subtle but relevant information. We applied a variational auto-encoder to learn the underlying distributions of the ECG of 41,927 UK Biobank participants, generating 32-dimensional representation (latent factors). The latent factors showed correlations to conventional ECG parameters and strong associations to cardiac phenotypes estimated from magnetic resonance imaging. We found definitive associations of the latent factors to conduction, rhythm, and structural disorders (all p < 4.51 × 10−308) and additionally value in mortality prediction. Genome wide association study (GWAS) of the latent factors, revealed 170 genetic loci with 29 not previously associated with electrocardiographic phenotypes. Further characterization of the genetic signals suggested involvement in cardiac development, contractility, and electrophysiology. Our results supported that the deep representation learning of 12-lead ECG could provide clinically meaningful and interpretable insights into cardiovascular biology and health.
AB - Conventional approaches to analyzing electrocardiograms (ECG) in discrete parameters (such as the PR interval) ignored the high dimensionality of data omitted subtle but relevant information. We applied a variational auto-encoder to learn the underlying distributions of the ECG of 41,927 UK Biobank participants, generating 32-dimensional representation (latent factors). The latent factors showed correlations to conventional ECG parameters and strong associations to cardiac phenotypes estimated from magnetic resonance imaging. We found definitive associations of the latent factors to conduction, rhythm, and structural disorders (all p < 4.51 × 10−308) and additionally value in mortality prediction. Genome wide association study (GWAS) of the latent factors, revealed 170 genetic loci with 29 not previously associated with electrocardiographic phenotypes. Further characterization of the genetic signals suggested involvement in cardiac development, contractility, and electrophysiology. Our results supported that the deep representation learning of 12-lead ECG could provide clinically meaningful and interpretable insights into cardiovascular biology and health.
KW - Artificial intelligence
KW - Cardiovascular medicine
UR - https://www.scopus.com/pages/publications/105012963567
U2 - 10.1016/j.isci.2025.113226
DO - 10.1016/j.isci.2025.113226
M3 - Article
AN - SCOPUS:105012963567
VL - 28
JO - iScience
JF - iScience
IS - 8
M1 - 113226
ER -