Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 30;37(8):1746-1759.
doi: 10.1093/humrep/deac131.

Development of an artificial intelligence model for predicting the likelihood of human embryo euploidy based on blastocyst images from multiple imaging systems during IVF

Affiliations

Development of an artificial intelligence model for predicting the likelihood of human embryo euploidy based on blastocyst images from multiple imaging systems during IVF

S M Diakiw et al. Hum Reprod. .

Abstract

Study question: Can an artificial intelligence (AI) model predict human embryo ploidy status using static images captured by optical light microscopy?

Summary answer: Results demonstrated predictive accuracy for embryo euploidy and showed a significant correlation between AI score and euploidy rate, based on assessment of images of blastocysts at Day 5 after IVF.

What is known already: Euploid embryos displaying the normal human chromosomal complement of 46 chromosomes are preferentially selected for transfer over aneuploid embryos (abnormal complement), as they are associated with improved clinical outcomes. Currently, evaluation of embryo genetic status is most commonly performed by preimplantation genetic testing for aneuploidy (PGT-A), which involves embryo biopsy and genetic testing. The potential for embryo damage during biopsy, and the non-uniform nature of aneuploid cells in mosaic embryos, has prompted investigation of additional, non-invasive, whole embryo methods for evaluation of embryo genetic status.

Study design, size, duration: A total of 15 192 blastocyst-stage embryo images with associated clinical outcomes were provided by 10 different IVF clinics in the USA, India, Spain and Malaysia. The majority of data were retrospective, with two additional prospectively collected blind datasets provided by IVF clinics using the genetics AI model in clinical practice. Of these images, a total of 5050 images of embryos on Day 5 of in vitro culture were used for the development of the AI model. These Day 5 images were provided for 2438 consecutively treated women who had undergone IVF procedures in the USA between 2011 and 2020. The remaining images were used for evaluation of performance in different settings, or otherwise excluded for not matching the inclusion criteria.

Participants/materials, setting, methods: The genetics AI model was trained using static 2-dimensional optical light microscope images of Day 5 blastocysts with linked genetic metadata obtained from PGT-A. The endpoint was ploidy status (euploid or aneuploid) based on PGT-A results. Predictive accuracy was determined by evaluating sensitivity (correct prediction of euploid), specificity (correct prediction of aneuploid) and overall accuracy. The Matthew correlation coefficient and receiver-operating characteristic curves and precision-recall curves (including AUC values), were also determined. Performance was also evaluated using correlation analyses and simulated cohort studies to evaluate ranking ability for euploid enrichment.

Main results and the role of chance: Overall accuracy for the prediction of euploidy on a blind test dataset was 65.3%, with a sensitivity of 74.6%. When the blind test dataset was cleansed of poor quality and mislabeled images, overall accuracy increased to 77.4%. This performance may be relevant to clinical situations where confounding factors, such as variability in PGT-A testing, have been accounted for. There was a significant positive correlation between AI score and the proportion of euploid embryos, with very high scoring embryos (9.0-10.0) twice as likely to be euploid than the lowest-scoring embryos (0.0-2.4). When using the genetics AI model to rank embryos in a cohort, the probability of the top-ranked embryo being euploid was 82.4%, which was 26.4% more effective than using random ranking, and ∼13-19% more effective than using the Gardner score. The probability increased to 97.0% when considering the likelihood of one of the top two ranked embryos being euploid, and the probability of both top two ranked embryos being euploid was 66.4%. Additional analyses showed that the AI model generalized well to different patient demographics and could also be used for the evaluation of Day 6 embryos and for images taken using multiple time-lapse systems. Results suggested that the AI model could potentially be used to differentiate mosaic embryos based on the level of mosaicism.

Limitations, reasons for caution: While the current investigation was performed using both retrospectively and prospectively collected data, it will be important to continue to evaluate real-world use of the genetics AI model. The endpoint described was euploidy based on the clinical outcome of PGT-A results only, so predictive accuracy for genetic status in utero or at birth was not evaluated. Rebiopsy studies of embryos using a range of PGT-A methods indicated a degree of variability in PGT-A results, which must be considered when interpreting the performance of the AI model.

Wider implications of the findings: These findings collectively support the use of this genetics AI model for the evaluation of embryo ploidy status in a clinical setting. Results can be used to aid in prioritizing and enriching for embryos that are likely to be euploid for multiple clinical purposes, including selection for transfer in the absence of alternative genetic testing methods, selection for cryopreservation for future use or selection for further confirmatory PGT-A testing, as required.

Study funding/competing interest(s): Life Whisperer Diagnostics is a wholly owned subsidiary of the parent company, Presagen Holdings Pty Ltd. Funding for the study was provided by Presagen with grant funding received from the South Australian Government: Research, Commercialisation, and Startup Fund (RCSF). 'In kind' support and embryology expertise to guide algorithm development were provided by Ovation Fertility. 'In kind' support in terms of computational resources provided through the Amazon Web Services (AWS) Activate Program. J.M.M.H., D.P. and M.P. are co-owners of Life Whisperer and Presagen. S.M.D., M.A.D. and T.V.N. are employees or former employees of Life Whisperer. S.M.D, J.M.M.H, M.A.D, T.V.N., D.P. and M.P. are listed as inventors of patents relating to this work, and also have stock options in the parent company Presagen. M.V. sits on the advisory board for the global distributor of the technology described in this study and also received support for attending meetings.

Trial registration number: N/A.

Keywords: ICSI outcome; IVF; PGT-A; artificial intelligence; assisted reproduction; embryo quality; genetics; machine learning; preimplantation genetic testing for aneuploidy.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Performance of the Day 5 artificial intelligence (AI) algorithm for predicting the likelihood of human embryo euploidy on uncleansed and cleansed blind test datasets. (A) Confusion matrices depicting true positives (TP), false positives (FP), false negatives (FN) and true negatives (TN) for the Day 5 AI model predicting embryo euploid status. Matrices are shown for uncleansed (top panel) and cleansed (bottom panel) blind test datasets. (B) Receiver-operating characteristic (ROC) curves for uncleansed (top panel) and untrainable data cleansing (UDC)-cleansed (bottom panel) Day 5 blind test datasets. The AUC values are depicted. (C) Precision-recall curves (PRC) for uncleansed (top panel) and UDC-cleansed (bottom panel) Day 5 blind test datasets. The AUC values are depicted. (D) The correlation between the genetics AI score and the proportion of euploid embryos was evaluated using four defined euploid likelihood categories as depicted. The statistical method used was Chi-square test for trend (df, degrees of freedom).
Figure 2.
Figure 2.
Correlations between the Day 5 genetics artificial intelligence (AI) score and level of mosaicism, monosomic abnormalities, and performance on Day 6 human embryos. (A) Correlation between average genetics AI score and embryos based on ploidy status, including euploid, aneuploid, or mosaic status. (B) Correlation between average AI score and embryo ploidy status, separating mosaic embryos according to level of mosaicism. (C) The correlation between the AI score and the proportion of euploid embryos was evaluated using euploid likelihood categories on a dataset of images taken of blastocyst-stage embryos on Day 6 of in vitro culture. (D) Average genetics AI score in embryos with monosomic or trisomic changes compared to euploid embryos. (E) Correlation between AI score and the proportion of embryos with monosomic changes in different AI score categories. Average AI scores were compared using one-way ANOVA with Tukey’s multiple comparisons post-test (Student’s t-test was used to compare monosomic with trisomic changes), and Chi-square test for trend was used where indicated (df = degrees of freedom). P-values are represented as follows: *P <0.05, ***P <0.001.
Figure 3.
Figure 3.
Performance of the Day 5 genetics artificial intelligence (AI) model in different demographics and using time-lapse images. (A) The correlation between the AI score and the proportion of human euploid embryos was evaluated using euploid likelihood categories on a double-blind test dataset of images from a clinic in India. (B) The correlation between AI score and the proportion of euploid embryos evaluated on a double-blind test dataset of images taken using the GERI time-lapse imaging system by three clinics in Spain. (C) The correlation between AI score and the proportion of euploid embryos evaluated on a blind test dataset of images taken using the EmbryoScope time-lapse imaging system by a clinic in Malaysia. The statistical method used was Chi-square test for trend (df, degrees of freedom).

Similar articles

Cited by

References

    1. Abhari S, Kawwass JF.. Pregnancy and neonatal outcomes after transfer of mosaic embryos: a review. J Clin Med 2021;10:1369. - PMC - PubMed
    1. Bouba I, Hatzi E, Ladias P, Sakaloglou P, Kostoulas C, Georgiou I.. Biological and Clinical Significance of Mosaicism in Human Preimplantation Embryos. J Dev Biol 2021;9:18. - PMC - PubMed
    1. Capalbo A, Rienzi L.. Mosaicism between trophectoderm and inner cell mass. Fertil Steril 2017;107:1098–1106. - PubMed
    1. Capalbo A, Rienzi L, Cimadomo D, Maggiulli R, Elliott T, Wright G, Nagy ZP, Ubaldi FM.. Correlation between standard blastocyst morphology, euploidy and implantation: an observational study in two centers involving 956 screened blastocysts. Hum Reprod 2014;29:1173–1181. - PubMed
    1. Chavez-Badiola A, Flores-Saiffe-Farías A, Mendizabal-Ruiz G, Drakeley AJ, Cohen J.. Embryo Ranking Intelligent Classification Algorithm (ERICA): artificial intelligence clinical assistant predicting embryo ploidy and implantation. Reprod Biomed Online 2020;41:585–593. - PubMed

Publication types