. 2022 Sep 1;22(1):228.

doi: 10.1186/s12911-022-01974-8.

Developing machine learning-based models to predict intrauterine insemination (IUI) success by address modeling challenges in imbalanced data and providing modification solutions for them

Sajad Khodabandelu^#¹, Zahra Basirat^#², Sara Khaleghi¹, Soraya Khafri³, Hussain Montazery Kordy⁴, Masoumeh Golsorkhtabaramiri²

Affiliations

¹ Student Research Committee, Babol University of Medical Sciences, Babol, Iran.
² Infertility and Reproductive Health Research Center, Health Research Institute, Babol University of Medical Sciences, Babol, Iran.
³ Infertility and Reproductive Health Research Center, Health Research Institute, Babol University of Medical Sciences, Babol, Iran. khafri@yahoo.com.
⁴ Faculty of Electrical and Computer Engineering, Babol Noshirvani University of Technology, Babol, Iran.

^# Contributed equally.

PMID: 36050710
PMCID: PMC9434923
DOI: 10.1186/s12911-022-01974-8

Developing machine learning-based models to predict intrauterine insemination (IUI) success by address modeling challenges in imbalanced data and providing modification solutions for them

Sajad Khodabandelu et al. BMC Med Inform Decis Mak. 2022.

. 2022 Sep 1;22(1):228.

doi: 10.1186/s12911-022-01974-8.

Authors

Sajad Khodabandelu^#¹, Zahra Basirat^#², Sara Khaleghi¹, Soraya Khafri³, Hussain Montazery Kordy⁴, Masoumeh Golsorkhtabaramiri²

Affiliations

¹ Student Research Committee, Babol University of Medical Sciences, Babol, Iran.
² Infertility and Reproductive Health Research Center, Health Research Institute, Babol University of Medical Sciences, Babol, Iran.
³ Infertility and Reproductive Health Research Center, Health Research Institute, Babol University of Medical Sciences, Babol, Iran. khafri@yahoo.com.
⁴ Faculty of Electrical and Computer Engineering, Babol Noshirvani University of Technology, Babol, Iran.

^# Contributed equally.

PMID: 36050710
PMCID: PMC9434923
DOI: 10.1186/s12911-022-01974-8

Abstract

Background: This study sought to provide machine learning-based classification models to predict the success of intrauterine insemination (IUI) therapy. Additionally, we sought to illustrate the effect of models fitting with balanced data vs original data with imbalanced data labels using two different types of resampling methods. Finally, we fit models with all features against optimized feature sets using various feature selection techniques.

Methods: The data for the cross-sectional study were collected from 546 infertile couples with IUI at the Fatemehzahra Infertility Research Center, Babol, North of Iran. Logistic regression (LR), support vector classification, random forest, Extreme Gradient Boosting (XGBoost) and, Stacking generalization (Stack) as the machine learning classifiers were used to predict IUI success by Python v3.7. We employed the Smote-Tomek (Stomek) and Smote-ENN (SENN) resampling methods to address the imbalance problem in the original dataset. Furthermore, to increase the performance of the models, mutual information classification (MIC-FS), genetic algorithm (GA-FS), and random forest (RF-FS) were used to select the ideal feature sets for model development.

Results: In this study, 28% of patients undergoing IUI treatment obtained a successful pregnancy. Also, the average age of women and men was 24.98 and 29.85 years, respectively. The calibration plot in this study for IUI success prediction by machine learning models showed that between feature selection methods, the RF-FS, and among the datasets used to fit the models, the balanced dataset with the Stomek method had well-calibrating predictions than other methods. Finally, the brier scores for the LR, SVC, RF, XGBoost, and Stack models that were fitted utilizing the Stomek dataset and the chosen feature set using the Random Forest technique obtained equal to 0.202, 0.183, 0.158, 0.129, and 0.134, respectively. It showed duration of infertility, male and female age, sperm concentration, and sperm motility grading score as the most predictable factors in IUI success.

Conclusion: The results of this study with the XGBoost prediction model can be used to foretell the individual success of IUI for each couple before initiating therapy.

Keywords: Cumulative live birth; Imbalanced data; Infertility; Intrauterine insemination; Machine learning.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

**Fig. 1**
Modeling steps with Python in this study

**Fig. 2**
Boxplot for G-means index, for each model. a: d show plots related to the feature selection methods. Abbreviations: RS method: Resampling method

**Fig. 3**
ROC curve and AUC index of each class by different models. Each row by 1: 4 numbers show graphs for each feature selection method and Columns a: c show plots related to the data used to model training

**Fig. 4**
Reliability and predictive power of each class by different model. Each row by 1: 4 numbers show graphs for each feature selection method; 1) Without feature selection (W_FS), 2) Mutual Information Classification feature selection (MIC-FS), 3) genetic algorithm feature selection (GA-FS), and 4) random forest feature selection (RF-FS), and Columns a: c show plots related to the data used to model training

**Fig. 5**
Boxplot, calibration plot, and ROC curve for trained models with random forest- selected features from the Stomek-balanced dataset

**Fig. 6**
Ranking of features used in XGBoost based on the effect on model learning and prediction

See this image and copyright information in PMC

Cited by

Diagnostic value of oxidation-reduction potential for male infertility: a systematic review and meta-analysis.
Tan Y, Yuan Y, Yang X, Wang Y, Liu L. Tan Y, et al. Transl Androl Urol. 2024 Jul 31;13(7):1228-1238. doi: 10.21037/tau-24-32. Epub 2024 Jul 16. Transl Androl Urol. 2024. PMID: 39100838 Free PMC article.
Criteria for implementing artificial intelligence systems in reproductive medicine.
Güell E. Güell E. Clin Exp Reprod Med. 2024 Mar;51(1):1-12. doi: 10.5653/cerm.2023.06009. Epub 2023 Dec 1. Clin Exp Reprod Med. 2024. PMID: 38035589 Free PMC article.
Artificial Intelligence for Clinical Management of Male Infertility, a Scoping Review.
Naik N, Roth B, Lundy SD. Naik N, et al. Curr Urol Rep. 2024 Nov 9;26(1):17. doi: 10.1007/s11934-024-01239-z. Curr Urol Rep. 2024. PMID: 39520645 Free PMC article.
The impact of adenomyosis on intrauterine insemination success in unexplained infertile women: a retrospective cross-sectional study.
Arlıer S, Kükrer S, Adıgüzel FI, Nessar AZ, Uysal G, Adıgüzel C, Kaplanoğlu DK. Arlıer S, et al. BMC Pregnancy Childbirth. 2025 Jun 3;25(1):650. doi: 10.1186/s12884-025-07769-9. BMC Pregnancy Childbirth. 2025. PMID: 40462001 Free PMC article.
An Algorithm to Predict the Lack of Pregnancy after Intrauterine Insemination in Infertile Patients.
Garcia-Grau E, Oliveira M, Amengual MJ, Rodriguez-Sanchez E, Veraguas-Imbernon A, Costa L, Benet J, Ribas-Maynou J. Garcia-Grau E, et al. J Clin Med. 2023 Apr 30;12(9):3225. doi: 10.3390/jcm12093225. J Clin Med. 2023. PMID: 37176664 Free PMC article.

See all "Cited by" articles

References

1. Pan MM, Hockenberry MS, Kirby EW, Lipshultz LI. Male infertility diagnosis and treatment in the era of in vitro fertilization and intracytoplasmic sperm injection. Med Clin. 2018;102(2):337–347. - PubMed
1. Muthigi A, Jahandideh S, Bishop LA, Naeemi FK, Shipley SK, O’Brien JE, Shin PR, Devine K, Tanrikut C. Clarifying the relationship between total motile sperm counts and intrauterine insemination pregnancy rates. Fertil Steril. 2021;115(6):1454–1460. - PubMed
1. Merviel P, Labarre M, James P, Bouée S, Chabaud J-J, Roche S, Cabry R, Scheffler F, Lourdel E, Benkhalifa M. Should intrauterine inseminations still be proposed in cases of unexplained infertility? Retrospective study and literature review. Arch Gynecol Obstet. 2022;66:1–14. - PubMed
1. Nesbit CB, Blanchette-Porter M, Esfandiari N. Ovulation induction and intrauterine insemination in women of advanced reproductive age: a systematic review of the literature. J Assist Reprod Genet. 2022;66:1–47. - PMC - PubMed
1. Guzick DS, Carson SA, Coutifaris C, Overstreet JW, Factor-Litvak P, Steinkampf MP, Hill JA, Mastroianni L, Jr, Buster JE, Nakajima ST. Efficacy of superovulation and intrauterine insemination in the treatment of infertility. N Engl J Med. 1999;340(3):177–183. - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Developing machine learning-based models to predict intrauterine insemination (IUI) success by address modeling challenges in imbalanced data and providing modification solutions for them

Affiliations

Developing machine learning-based models to predict intrauterine insemination (IUI) success by address modeling challenges in imbalanced data and providing modification solutions for them

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

LinkOut - more resources

Full Text Sources