Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 8:30:e943666.
doi: 10.12659/MSM.943666.

Machine Learning-Based Prediction of Helicobacter pylori Infection Study in Adults

Affiliations

Machine Learning-Based Prediction of Helicobacter pylori Infection Study in Adults

Min Liu et al. Med Sci Monit. .

Abstract

BACKGROUND Helicobacter pylori has a high infection rate worldwide, and epidemiological study of H. pylori is important. Artificial intelligence has been widely used in the field of medical research and has become a hotspot in recent years. This paper proposed a prediction model for H. pylori infection based on machine learning in adults. MATERIAL AND METHODS Adult patients were selected as research participants, and information on 30 factors was collected. The chi-square test, mutual information, ReliefF, and information gain were used to screen the feature factors and establish 2 subsets. We constructed an H. pylori infection prediction model based on XGBoost and optimized the model using a grid search by analyzing the correlation between features. The performance of the model was assessed by comparing its accuracy, recall, precision, F1 score, and AUC with those of 4 other classical machine learning methods. RESULTS The model performed better on the part B subset than on the part A subset. Compared with the other 4 machine learning methods, the model had the highest accuracy, recall, F1 score, and AUC. SHAP was used to evaluate the importance of features in the model. It was found that H. pylori infection of family members, living in rural areas, poor washing hands before meals and after using the toilet were risk factors for H. pylori infection. CONCLUSIONS The model proposed in this paper is superior to other models in predicting H. pylori infection and can provide a scientific basis for identifying the population susceptible to H. pylori and preventing H. pylori infection.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest: None declared

Figures

Figure 1
Figure 1
Patient inclusion and model training process.
Figure 2
Figure 2
Feature assignment importance ranking by heatmap. Part A: the sum of the importance of the 4 algorithms ranked the top 10 features. Part B: The sum of the importance of the 4 algorithms ranked the top 15 features.
Figure 3
Figure 3
AUC curve of the prediction model: (A) part A; (B) part B.
Figure 4
Figure 4
Feature importance histogram.
Figure 5
Figure 5
Feature importance scatter plot. SHAP value: negative SHAP values indicate a decreased risk of infection, positive SHAP values indicate an increased risk of infection, and the color represents the magnitude of the feature value.
Figure 6
Figure 6
Violin plots of feature importance for living area, hand-washing before meals and after using the toilet, and sharing utensils (SHAP values >0: danger; SHAP values <0: protection).

Similar articles

Cited by

References

    1. Diaconu S, Predescu A, Moldoveanu A, et al. Infection: Old and new. J Med Life. 2017;10(2):112–17. - PMC - PubMed
    1. Hooi J, Lai WY, Ng WK, et al. Global prevalence of Helicobacter pylori infection: Systematic review and meta-analysis. Gastroenterology. 2017;153(2):420–29. - PubMed
    1. Ren S, Cai P, Liu Y, et al. Prevalence of Helicobacter pylori infection in China: A systematic review and meta-analysis. J Gastroenterol Hepatol. 2022;37(3):464–70. - PubMed
    1. Yuan G, Chen Y, He S. Family history of gastric cancer and Helicobacter pylori treatment. N Engl J Med. 2020;382(22):2171. - PubMed
    1. Al-Khalidy HSH. Obesity and Helicobacter pylori infection in adults with non specific colitis. J Coloproctol. 2020;40(2):168–71.