Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Meta-Analysis
. 2019 Jul;300(1):7-14.
doi: 10.1007/s00404-019-05151-7. Epub 2019 May 3.

Use of artificial intelligence (AI) in the interpretation of intrapartum fetal heart rate (FHR) tracings: a systematic review and meta-analysis

Affiliations
Meta-Analysis

Use of artificial intelligence (AI) in the interpretation of intrapartum fetal heart rate (FHR) tracings: a systematic review and meta-analysis

Jacques Balayla et al. Arch Gynecol Obstet. 2019 Jul.

Abstract

Objectives: To determine the degree of inter-rater reliability (IRR) between human and artificial intelligence (AI) interpretation of fetal heart rate tracings (FHR), and to determine whether AI-assisted electronic fetal monitoring interpretation improves neonatal outcomes amongst laboring women.

Data sources: We searched Medline, EMBASE, Google Scholar, Scopus, ISI Web of Science and Cochrane database search, as well as PubMed ( www.pubmed.gov ) and RCT registry ( www.clinicaltrials.gov ) until the end of October 2018 to conduct a systematic review and meta-analysis comparing visual and AI interpretation of EFM in labor. Similarly, we sought out all studies evaluating the IRR between AI and expert interpretation of EFM.

Tabulation, integration and results: Weighed mean Cohen's Kappa was calculated to assess the global IRR. Risk of bias was assessed using the Cochrane Handbook for Systematic Reviews of Interventions. We used relative risks (RR) and a random effects (RE) model to calculate weighted estimates. Statistical homogeneity was checked by the χ2 test and I2 using Review Manager 5.3.5 (The Cochrane Collaboration, 2014.) We obtained 201 records, of which 9 met inclusion criteria. Three RCT's were used to compare the neonatal outcomes and 6 cohort studies were used to establish the degree of IRR between both approaches of EFM evaluation. With regards to the neonatal outcomes, a total of 55,064 patients were included in the analysis. Relative to the use of clinical (visual) evaluation of the FHR, the use of AI did not change the incidence rates of neonatal acidosis, cord pH below < 7.20, 5-min APGAR scores < 7, mode of delivery, NICU admission, neonatal seizures, or perinatal death. With regards to the degrees of inter-rater reliability, a weighed mean Cohen's Kappa of 0.49 [0.32-0.66] indicates moderate agreement between expert observers and computerized systems.

Conclusion: The use of AI and computer analysis for the interpretation of EFM during labor does not improve neonatal outcomes. Inter-rater reliability between experts and computer systems is moderate at best. Future studies should aim at further elucidating these findings.

Keywords: Artificial intelligence; Computer; Fetal heart rate; Fetal monitoring; Inter-rater reliability; Neonatal outcomes.

PubMed Disclaimer

LinkOut - more resources