Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Apr;141(4):1354-1364.e9.
doi: 10.1016/j.jaci.2017.11.027. Epub 2017 Dec 19.

An algorithm for the classification of mRNA patterns in eosinophilic esophagitis: Integration of machine learning

Affiliations

An algorithm for the classification of mRNA patterns in eosinophilic esophagitis: Integration of machine learning

Benjamin F Sallis et al. J Allergy Clin Immunol. 2018 Apr.

Abstract

Background: Diagnostic evaluation of eosinophilic esophagitis (EoE) remains difficult, particularly the assessment of the patient's allergic status.

Objective: This study sought to establish an automated medical algorithm to assist in the evaluation of EoE.

Methods: Machine learning techniques were used to establish a diagnostic probability score for EoE, p(EoE), based on esophageal mRNA transcript patterns from biopsies of patients with EoE, gastroesophageal reflux disease and controls. Dimensionality reduction in the training set established weighted factors, which were confirmed by immunohistochemistry. Following weighted factor analysis, p(EoE) was determined by random forest classification. Accuracy was tested in an external test set, and predictive power was assessed with equivocal patients. Esophageal IgE production was quantified with epsilon germ line (IGHE) transcripts and correlated with serum IgE and the Th2-type mRNA profile to establish an IGHE score for tissue allergy.

Results: In the primary analysis, a 3-class statistical model generated a p(EoE) score based on common characteristics of the inflammatory EoE profile. A p(EoE) ≥ 25 successfully identified EoE with high accuracy (sensitivity: 90.9%, specificity: 93.2%, area under the curve: 0.985) and improved diagnosis of equivocal cases by 84.6%. The p(EoE) changed in response to therapy. A secondary analysis loop in EoE patients defined an IGHE score of ≥37.5 for a patient subpopulation with increased esophageal allergic inflammation.

Conclusions: The development of intelligent data analysis from a machine learning perspective provides exciting opportunities to improve diagnostic precision and improve patient care in EoE. The p(EoE) and the IGHE score are steps toward the development of decision trees to define EoE subpopulations and, consequently, will facilitate individualized therapy.

Keywords: Allergy diagnosis; IgE; chronic allergic inflammation; eosinophilic esophagitis; eosinophils; machine learning; medical algorithm.

PubMed Disclaimer

Conflict of interest statement

Disclosure of potential conflict of interest: W. S. Lexmond has received grant funding from Ter Meulen Fund, Royal Netherlands Academy of Sciences and the Banning-de Jong Fund; fees from Kiniksa Pharmaceuticals for consultation; and his institution has received grant funds from Mead Johnson Company. Matthew J. Hamilton’s institution has grants pending with GlaxoSmithKline; and he has received consultancy fees from Pfizer, Takeda, and Protal Instruments. J. D. Goldsmith has received consulting fees from Roche Diagnostics and Takeda Pharmaceuticals; travel support from the College of American Pathologists and the Crohn’s and Colitis Foundation; and fees for expert testimony. The rest of the authors declare that they have no relevant conflicts of interest.

Figures

FIG 1.
FIG 1.
Recruitment and dimensionality reduction of normalized mRNA transcripts. A, Patient selection. B, Determination of gene weights. Volcano plots of normalized mRNA transcripts displayed as fold difference (x-axis) and significance (y-axis) were used for the calculation of the factors differentiating EoE and GERD in the proximal and distal esophagus of the training set (n = 113). C, Transcript weights of the factors differentiating EoE and GERD in the distal biopsy. Red indicates weight > 10.
FIG 2.
FIG 2.
HIF1A protein expression in patient biopsies. A, Immunohistochemistry of esophageal tissue sections from EoE, control, and GERD patients stained for HIF1A (n = 5 per group; representative distal biopsy). Tissue sections are shown at 200× and 400×. B, Isotype control (400×). C, ImageJ quantification; P values as calculated by Dunn multiple comparison test after Kruskal-Wallis test.
FIG 3.
FIG 3.
Establishing diagnostic probability scores. A, Training set analysis strategy. B, Probability scores displayed as p(EoE), p(GERD), and p(Control). C, Cluster analysis based on probability scores.
FIG 4.
FIG 4.
Testing diagnostic and predictive accuracy. A, External test set and equivocal test set analysis strategy. B, ROC analysis of p(EoE) as a diagnostic parameter for EoE, p(Control) as a diagnostic marker for controls, and p(GERD) as a diagnostic marker for GERD. C, Diagnosis of the external test set based on probability scores. D, Probability score-based diagnosis of the equivocal test set.
FIG 5.
FIG 5.
p(EoE) as a composite score to monitor therapy response. A, p(EoE) before and after steroid treatment. B, Examples of alterations in highly weighted EoE transcripts to steroid treatment. C, Heat map of most significantly altered transcripts. *P < .05 and **P < .01, using ratio paired t test.
FIG 6.
FIG 6.
Serum IgE, esophageal IGHE transcript levels and local TH2-type inflammation. A, Patient key. B, Correlation of serum IgE titers and esophageal IGHE transcript levels. Separation of EoE patients based on serum IgE and IGHE. C, Eosinophil counts in IGHE-low (white circles: normal serum IgE; black circles: elevated IgE) and IGHE-high patients (orange circles: normal IgE; red circles: elevated IgE). D, Representative heat map of patients from the IGHE-high/serum-IgE-normal and IGHE-low/serum-IgE-elevated. Only the most enriched genes are shown. E, Representative transcripts: FcεRIβ, CCL5, and IL-13. **P < .01, ***P < .001, as calculated by Dunn multiple comparison test after Kruskal-Wallis test.
FIG 7.
FIG 7.
IGHE score as a readout of esophageal allergic inflammation. A, Primary and secondary analysis loop. B, EoE patient groups stratified base on a composite score to distinguish IGHE-high and IGHE-low groups. C, ROC analysis. D, Correlation of IGHE-score with FcεRIβ mRNA. For patient key, see Fig 6, A.

Similar articles

Cited by

References

    1. Cianferoni A, Spergel JM. Eosinophilic esophagitis and gastroenteritis. Curr Allergy Asthma Rep 2015;15:58. - PubMed
    1. Dehlink E, Fiebiger E. The role of the high-affinity IgE receptor, FcepsilonRI, in eosinophilic gastrointestinal diseases. Immunol Allergy Clin North Am 2009;29: 159–70, xii. - PubMed
    1. Furuta GT, Forbes D, Boey C, Dupont C, Putnam P, Roy S, et al. Eosinophilic gastrointestinal diseases (EGIDs). J Pediatr Gastroenterol Nutr 2008;47:234–8. - PubMed
    1. Hommel KA, Franciosi JP, Hente EA, Ahrens A, Rothenberg ME. Treatment adherence in pediatric eosinophilic gastrointestinal disorders. J Pediatr Psychol 2012;37:533–42. - PMC - PubMed
    1. Fahey LM, Liacouras CA. Eosinophilic gastrointestinal disorders. Pediatr Clin North Am 2017;64:475–85. - PubMed

Publication types