Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 Apr 10:rs.3.rs-2635419.
doi: 10.21203/rs.3.rs-2635419/v1.

A Comprehensive and Bias-Free Machine Learning Approach for Risk Prediction of Preeclampsia with Severe Features in a Nulliparous Study Cohort

Affiliations

A Comprehensive and Bias-Free Machine Learning Approach for Risk Prediction of Preeclampsia with Severe Features in a Nulliparous Study Cohort

Yun Lin et al. Res Sq. .

Update in

Abstract

Objective: Preeclampsia is one of the leading causes of maternal morbidity, with consequences during and after pregnancy. Because of its diverse clinical presentation, preeclampsia is an adverse pregnancy outcome that is uniquely challenging to predict and manage. In this paper, we developed machine learning models that predict the onset of preeclampsia with severe features or eclampsia at discrete time points in a nulliparous pregnant study cohort.

Materials and methods: The prospective study cohort to which we applied machine learning is the Nulliparous Pregnancy Outcomes Study: Monitoring Mothers-to-be (nuMoM2b) study, which contains information from eight clinical sites across the US. Maternal serum samples were collected for 1,857 individuals between the first and second trimesters. These patients with serum samples collected are selected as the final cohort.

Results: Our prediction models achieved an AUROC of 0.72 (95% CI, 0.69-0.76), 0.75 (95% CI, 0.71-0.79), and 0.77 (95% CI, 0.74-0.80), respectively, for the three visits. Our initial models were biased toward non-Hispanic black participants with a high predictive equality ratio of 1.31. We corrected this bias and reduced this ratio to 1.14. The top features stress the importance of using several tests, particularly for biomarkers and ultrasound measurements. Placental analytes were strong predictors for screening for the early onset of preeclampsia with severe features in the first two trimesters.

Conclusion: Experiments suggest that it is possible to create racial bias-free early screening models to predict the patients at risk of developing preeclampsia with severe features or eclampsia nulliparous pregnant study cohort.

PubMed Disclaimer

Conflict of interest statement

CONFLICTS OF INTEREST All authors declare no financial or non-financial competing interests.

Figures

Figure 1
Figure 1. Data process timeline
This figure shows the gestational weeks at each visit. For each visit, the number of features at that visit is listed and the category of new feature included is also shown. 1 Specific medical condition can be found in Table Supplement 5 2 Specific placental analytes can be found in Table Supplement 3
Figure 2
Figure 2. Final study cohort selection process.
Out of the participants from the placental analytes sub-study, we excluded participants with conditions such as chronic hypertension, mild preeclampsia, and missing label for preeclampsia to focus on the participants that are most at risk.
Figure 3
Figure 3. The training process of PEPrML pipeline.
Samples were balanced for train and test sets. 5-fold grid search cross-validation was used to select the hyperparameters for each trial. We repeated 100 trials and recorded the results.
Figure 4
Figure 4. sPE+E vs NPH and early sPE vs late sPE+E model performance.
a) Average AUC for 100 trials per visit for 4 classifiers. b) RF classifier has best performance across visits for both comparisons. The ROC curve demonstrated the tradeoff between the true positive rate versus false positive rate. This summarizes the results for 100 trials.
Figure 6
Figure 6. Fairness check for sPE+E vs NPH mode
The threshold set based on the four-fifth rule are 0.8 and 1.25. Ceribus Paribus plot was used to adjust prediction threshold for the Black population.

Similar articles

References

    1. Lockwood CJ, Moore T, Copel J, Silver RM, Resnik R. Creasy and Resnik’s Maternal-Fetal Medicine: Principles and Practice. 9th ed. (Dugoff L, Louis J, eds.). Elsevier - Health Sciences Division; 2022.
    1. Poon LC, Nicolaides KH. Early prediction of preeclampsia. Obstetrics and Gynecology International. 2014;2014. - PMC - PubMed
    1. Wójtowicz A, Zembala-Szczerba M, Babczyk D, Kołodziejczyk-Pietruszka M, Lewaczyńska O, Huras H. Early- and Late-Onset Preeclampsia: A Comprehensive Cohort Study of Laboratory and Clinical Findings according to the New ISHHP Criteria. International Journal of Hypertension. 2019;2019:1–9. - PMC - PubMed
    1. Sroka D, Verlohren S. Short Term Prediction of Preeclampsia, 2021.
    1. Facco FL, Lappen J, Lim C, Zee PC, Grobman WA. Preeclampsia and sleep-disordered breathing: A case-control study. Pregnancy Hypertension: An International Journal of Women's Cardiovascular Health. April 2013;3:133–139. - PMC - PubMed

Publication types

LinkOut - more resources