Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 3;112(7):1562-1579.
doi: 10.1016/j.ajhg.2025.05.015. Epub 2025 Jun 20.

Applying multimodal AI to physiological waveforms improves genetic prediction of cardiovascular traits

Affiliations

Applying multimodal AI to physiological waveforms improves genetic prediction of cardiovascular traits

Yuchen Zhou et al. Am J Hum Genet. .

Abstract

Electronic health records, biobanks, and wearable biosensors enable the collection of multiple health modalities from many individuals. Access to multimodal health data provides a unique opportunity for genetic studies of complex traits because different modalities relevant to a single physiological system (e.g., circulatory system) encode complementary and overlapping information. We propose a multimodal deep learning method, multimodal representation learning for genetic discovery on low-dimensional embeddings (M-REGLE), for discovering genetic associations from a joint representation of complementary electrophysiological waveform modalities. M-REGLE jointly learns a lower representation (i.e., latent factors) of multimodal physiological waveforms using a convolutional variational autoencoder, performs genome-wide association studies (GWASs) on each latent factor, then combines the results to study the genetics of the underlying system. To validate the advantages of M-REGLE and multimodal learning, we apply it to common cardiovascular modalities (photoplethysmogram [PPG] and electrocardiogram [ECG]) and compare its results to unimodal learning methods in which representations are learned from each data modality separately but are statistically combined for downstream genetic comparison. M-REGLE identifies 19.3% more loci on the 12-lead ECG dataset, 13.0% more loci on the ECG lead I + PPG dataset, and its genetic risk score significantly outperforms the unimodal risk score at predicting cardiac phenotypes, such as atrial fibrillation (Afib), in multiple biobanks.

Keywords: AI; GWAS; PRS; cardiovascular disease; deep learning; fusion model; multimodal; representation learning; variational autoencoder.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests Y.Z., J.K., T.Y., H.Y., A.C., C.Y.M., B.B., and F.H. are current or former employees of Google, and own Alphabet Inc stocks. This study was funded by Google LLC. A.P.K. has acted as a paid consultant or lecturer to Abbvie, Aerie, Allergan, Google Health, Heidelberg Engineering, Novartis, Reichert, Santen, Thea, and Topcon. A.F.S. has received funding from New Amsterdam Pharma for an unrelated project.

Figures

None
Graphical abstract
Figure 1
Figure 1
Overview of M-REGLE (A) M-REGLE steps on lead I ECG and PPG, (B) U-REGLE steps on lead I ECG and PPG, (C) M-REGLE steps on 12-lead ECG, and (D) U-REGLE steps on 12-lead ECG. Step 1 in M-REGLE (A and C) obtains the raw embeddings from multimodal health data in a joint fit while step 1 in U-REGLE (B and D) obtains the raw embeddings for each modality separately. In step 2, to ensure completely uncorrelated embeddings, we applied PCA on the raw embeddings. Lastly, we ran GWAS on the uncorrelated embeddings and combined them.
Figure 2
Figure 2
M-REGLE embeddings improve cardiovascular trait prediction (A) Validation reconstruction losses in log scale of U-REGLE and M-REGLE on 12-lead ECG data. The x axis is the numbers of latent dimensions (1, 2, 4, 8, 16, 32, 64) per ECG lead. Standard errors (SEs) are too small to plot (see Table S9 for values of reconstruction losses and SEs. See Figure S25 for the normal scaled reconstruction loss plots.). (B) Validation reconstruction losses in log scale of M-REGLE and U-REGLE across numbers of latent dimensions on lead I ECG and PPG data. The x axis is the numbers of latent dimensions used for each modality (see Table S13 for values of M-REGLE and U-REGLE reconstruction losses and SEs). All the difference between M-REGLE and U-REGLE in (A) and (B) are significant. (C) AUROC prediction of nine phenotypes utilizing ElasticNet trained on the 12 embeddings obtained from ECG lead I and PPG. (D) AUPRC prediction of nine phenotypes utilizing ElasticNet trained on the 12 embeddings obtained from ECG lead I and PPG. Star sign indicates a statistically significant difference between the two methods using paired bootstrapping (100 repetitions) with 95% confidence.
Figure 3
Figure 3
The effect of perturbing distinctive M-REGLE embedding dimensions on decoded ECG and PPG waveforms from a healthy toward an unhealthy sample (A) The impact of the fourth dimension. As the embedding transitions from healthy to unhealthy, the decoded ECG (left) exhibits a prolonged QT interval, while the PPG (right) loses its notch. (B) The impact of the 10th dimension. A shift from healthy to unhealthy along this dimension results in a progressive decrease in ECG amplitude.
Figure 4
Figure 4
M-REGLE on 12 ECG leads increases genomic discovery (A and B) (A) Manhattan plot depicting M-REGLE GWAS p values for all 22 autosomal chromosomes. Black gene names indicate the closest gene for each locus with log10p>20. Purple dots denote the GWS loci uniquely detected by M-REGLE. Orange dots indicate loci also identified in U-REGLE. (B) Comparison of M-REGLE GWS variants in hits with U-REGLE. The x axis is the logp value of U-REGLE. The y axis is the logp value of the M-REGLE. All p values in (A) and (B) are computed by summing the chi-squared statistics for all 96 embeddings to perform a single joint chi-squared test. The vertical and horizontal red lines indicate the GWS level. The diagonal red line indicates y=x. The orange dots indicate variants in hits that are significant for U-REGLE but not significant for our M-REGLE and green dots indicate variants in hits that are significant for our M-REGLE but not significant for U-REGLE. (C) A three-way Venn diagram of the GWAS Catalog loci, loci discovered by M-REGLE, and loci discovered by U-REGLE. (D) Comparison of the chi-squared statistics for all known significant variants in GWAS Catalog for both U-REGLE and M-REGLE. The difference is statistically significant.
Figure 5
Figure 5
M-REGLE on ECG lead I and PPG increases genomic discovery (A) Manhattan plot depicting M-REGLE GWAS p values. Black gene names indicate the closest gene for each locus with log10p>20. Purple dots denote the GWS loci detected uniquely by M-REGLE. Orange dots indicate loci also identified in U-REGLE. (B) Comparison of M-REGLE GWS variants-in-hits with U-REGLE. The x axis is the logp value of baseline. The y axis is the logp value of the M-REGLE. All p values (A) and (B) are computed by summing the chi-squared statistics for all 12 embeddings to perform a single joint chi-squared test. The vertical and horizontal red lines indicate the GWS level. The diagonal red line indicates y=x. The orange dots indicate variants in hits that are significant for U-REGLE but not significant for our M-REGLE and green dots indicate variants in hits that are significant for our M-REGLE but not significant for baseline. (C) A three-way Venn diagram of the GWAS Catalog loci, loci discovered by M-REGLE, and loci discovered by U-REGLE. (D) Comparison of the chi-squared statistics for all known significant variants in GWAS Catalog for both U-REGLE and M-REGLE. The difference is statistically significant.
Figure 6
Figure 6
M-REGLE improves Afib genetic risk score (A) The x axis is genetic risk score percentile and y axis is the prevalence. Lower is better for the bottom percentiles; higher is better for the top percentiles. (B and C) (B) AUROC and (C) AUPRC (precision recall) Star sign indicates a statistically significant difference between the two methods using paired bootstrapping (100 repetitions) with 95% confidence.

Similar articles

References

    1. Elliott L.T., Sharp K., Alfaro-Almagro F., Shi S., Miller K.L., Douaud G., Marchini J., Smith S.M. Genome-wide association studies of brain imaging phenotypes in UK biobank. Nature. 2018;562:210–216. doi: 10.1038/s41586-018-0571-7. - DOI - PMC - PubMed
    1. Jónsson B., Bjornsdottir G., Thorgeirsson T., Ellingsen L.M., Walters G.B., Gudbjartsson D.F., Stefansson H., Stefansson K., Ulfarsson M.O. Brain age prediction using deep learning uncovers associated sequence variants. Nat. Commun. 2019;10:5409. doi: 10.1038/s41467-019-13163-9. - DOI - PMC - PubMed
    1. Verweij N., Benjamins J.-W., Morley M.P., van de Vegte Y.J., Teumer A., Trenkwalder T., Reinhard W., Cappola T.P., van der Harst P. The genetic makeup of the electrocardiogram. Cell Syst. 2020;11:229–238.e5. doi: 10.1016/j.cels.2020.08.005. - DOI - PMC - PubMed
    1. Alipanahi B., Hormozdiari F., Behsaz B., Cosentino J., McCaw Z.R., Schorsch E., Sculley D., Dorfman E.H., Foster P.J., Peng L.H., et al. Large-scale machine-learning-based phenotyping significantly improves genomic discovery for optic nerve head morphology. Am. J. Hum. Genet. 2021;108:1217–1230. doi: 10.1016/j.ajhg.2021.05.004. - DOI - PMC - PubMed
    1. Aung N., Vargas J.D., Yang C., Fung K., Sanghvi M.M., Piechnik S.K., Neubauer S., Manichaikul A., Rotter J.I., Taylor K.D., et al. Genome-wide association analysis reveals insights into the genetic architecture of right ventricular structure and function. Nat. Genet. 2022;54:783–791. doi: 10.1038/s41588-022-01083-2. - DOI - PMC - PubMed

LinkOut - more resources