Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Sep 11;19(1):185.
doi: 10.1186/s12911-019-0899-4.

Risk factor analysis of device-related infections: value of re-sampling method on the real-world imbalanced dataset

Affiliations

Risk factor analysis of device-related infections: value of re-sampling method on the real-world imbalanced dataset

Xiang-Fei Feng et al. BMC Med Inform Decis Mak. .

Abstract

Background: The incidence of cardiac implantable electronic device infection (CIEDI) is low and usually belongs to the typical imbalanced dataset. We sought to describe our experience on the management of the imbalanced CIEDI dataset.

Methods: Database from two centers of patients undergoing device implantation from 2001 to 2016 were reviewed retrospectively. Re-sampling technique was used to improve the classifier accuracy.

Results: CIEDI was identified in 28 out of 4959 procedures (0.56%); a high imbalance existed in the sizes of the patient profiles. In univariate analyses, replacement procedure and male were significantly associated with an increase in CIEDI: (53.6% vs. 23.4, 0.8% vs. 0.3%, P < 0.01). Multivariate logistic regression analysis showed that gender (odds ratio, OR = 3.503), age (OR = 1.032), replacement procedure (OR = 3.503), and use of antibiotics (OR = 0.250) remained as independent predictors of CIEDI (all P < 0.05) after adjustment for diabetes, post-operation fever, and device style, device company. There were 616 under-sampled cases and 123 over-sampled cases in the analyzed cohort after re-sampling. The re-sampling and bootstrap results were robust and largely like the analysis results prior re-sampling method, while use of antibiotics lost the predicting capacity for CIEDI after re-sampling technique (P > 0.05).

Conclusion: The application of re-sampling techniques can generate useful synthetic samples for the classification of imbalanced data and improve the accuracy of predicting efficacy of CIEDI. The peri-operative assessment should be intensified in male and aged patients as well as patients receiving replacement procedures for the risk of CIEDI.

Keywords: Device; Implantation; Infection; Re-sampling technique; Risk factors.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Study design and flow chart
Fig. 2
Fig. 2
Comparison in ROC curves and AUCs between before (left) and after (right) re-sampling in prediction of CIEDI with similar variables. As shown in Figure, whether before or after re-sampling, replacement procedure, gender, and age all had large AUC. This suggests potential roles of them in prediction of patients at risks for CIEDI

References

    1. Traykov V, Bongiorni MG, Boriani G, Burri H, Costa R, Dagres N, Deharo JC, Epstein LM, Erba PA, Snygg-Martin U, et al. Clinical practice and implementation of guidelines for the prevention, diagnosis and management of cardiac implantable electronic device infections: results of a worldwide survey under the auspices of the European heart rhythm association. Europace. 2019;21(8):1270–9. doi: 10.1093/europace/euz137. - DOI - PubMed
    1. Oh S, Lee MS, Zhang BT. Ensemble learning with active example selection for imbalanced biomedical data classification. IEEE/ACM Trans Comput Biol Bioinform. 2011;8(2):316–325. doi: 10.1109/TCBB.2010.96. - DOI - PubMed
    1. King G, Zeng L. Logistic regression in rare events data. Polit Anal. 2001;9(2):137–163. doi: 10.1093/oxfordjournals.pan.a004868. - DOI
    1. Ranjan C, Mustonen M, Paynabar K, Pourak K. Dataset: rare event classification in multivariate time series. arXiv. 2018;1809:10717.
    1. Santos MS, Abreu PH, Garcia-Laencina PJ, Simao A, Carvalho A. A new cluster-based oversampling method for improving survival prediction of hepatocellular carcinoma patients. J Biomed Inform. 2015;58(12):49–59. doi: 10.1016/j.jbi.2015.09.012. - DOI - PubMed

Publication types

LinkOut - more resources