Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 22;25(1):bbad492.
doi: 10.1093/bib/bbad492.

Identify gestational diabetes mellitus by deep learning model from cell-free DNA at the early gestation stage

Affiliations

Identify gestational diabetes mellitus by deep learning model from cell-free DNA at the early gestation stage

Yipeng Wang et al. Brief Bioinform. .

Abstract

Gestational diabetes mellitus (GDM) is a common complication of pregnancy, which has significant adverse effects on both the mother and fetus. The incidence of GDM is increasing globally, and early diagnosis is critical for timely treatment and reducing the risk of poor pregnancy outcomes. GDM is usually diagnosed and detected after 24 weeks of gestation, while complications due to GDM can occur much earlier. Copy number variations (CNVs) can be a possible biomarker for GDM diagnosis and screening in the early gestation stage. In this study, we proposed a machine-learning method to screen GDM in the early stage of gestation using cell-free DNA (cfDNA) sequencing data from maternal plasma. Five thousand and eighty-five patients from north regions of Mainland China, including 1942 GDM, were recruited. A non-overlapping sliding window method was applied for CNV coverage screening on low-coverage (~0.2×) sequencing data. The CNV coverage was fed to a convolutional neural network with attention architecture for the binary classification. The model achieved a classification accuracy of 88.14%, precision of 84.07%, recall of 93.04%, F1-score of 88.33% and AUC of 96.49%. The model identified 2190 genes associated with GDM, including DEFA1, DEFA3 and DEFB1. The enriched gene ontology (GO) terms and KEGG pathways showed that many identified genes are associated with diabetes-related pathways. Our study demonstrates the feasibility of using cfDNA sequencing data and machine-learning methods for early diagnosis of GDM, which may aid in early intervention and prevention of adverse pregnancy outcomes.

Keywords: cell-free DNA; copy number variations; deep learning; gestational diabetes mellitus.

PubMed Disclaimer

Figures

Figure 1
Figure 1
An overview of the GDM classification model. (A) The pipeline for processing sequencing data to generate count matrix for deep learning model. (B) An example for the count matrix constructure. (C) The deep learning architecture of our model.
Figure 2
Figure 2
Clinical information of the collected samples. (A) The age distribution of the normal group and GDM group. (B) The weight distribution of the normal group and GDM group. (C) The sample time distribution of the normal and GDM group. (D) PCA scatter plot of normal samples and GDM samples.
Figure 3
Figure 3
Copy number diversity on different window size along the whole chromosome.
Figure 4
Figure 4
Model learning performance on different window size. (AD) Learning accuracy curve for window size 50, 20, 10 and 5 k, respectively. (EH) Learning loss curve for window size 50, 20, 10 and 5 k, respectively.
Figure 5
Figure 5
Model performance on test dataset. (AD) ROC curve for window size 50, 20, 10 and 5 k, respectively. The black line represents the ROC curve generated by the test dataset, while the gray lines represent the ROCs obtained from cross-validation. (E) The AUC distribution of 10 k window size on 10 000 bootstrapping experiments. (F) The AUCs of variant window sizes on bootstrapping experiments.
Figure 6
Figure 6
The results of GO term and KEGG pathway enrichment of annotated genes. (A) Enriched GO terms with adjusted P-value smaller than 0.05. (B) Enriched KEGG pathways with adjusted P-value smaller than 0.05.

Similar articles

Cited by

References

    1. McIntyre HD, Catalano P, Zhang C, et al. Gestational diabetes mellitus. Nat Rev Dis Primers 2019;5(1):47. - PubMed
    1. Monteiro LJ, Norman JE, Rice GE, Illanes SE. Fetal programming and gestational diabetes mellitus. Placenta 2016;48:S54–60. - PubMed
    1. Pinney SE, Simmons RA. Metabolic programming, epigenetics, and gestational diabetes mellitus. Curr Diab Rep 2012;12:67–74. - PubMed
    1. Alejandro EU, Mamerto TP, Chung G, et al. Gestational diabetes mellitus: a harbinger of the vicious cycle of diabetes. Int J Mol Sci 2020;21(14):5003. - PMC - PubMed
    1. Hod M, Kapur A, Sacks DA, et al. The International Federation of Gynecology and Obstetrics (FIGO) initiative on gestational diabetes mellitus: a pragmatic guide for diagnosis, management, and care. Int J Gynaecol Obstet 2015;131:S173–211. - PubMed

Publication types