Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Multicenter Study
. 2013 Dec;28(12):1565-72.
doi: 10.1007/s11606-013-2443-z. Epub 2013 May 4.

Using patients like my patient for clinical decision support: institution-specific probability of celiac disease diagnosis using simplified near-neighbor classification

Affiliations
Multicenter Study

Using patients like my patient for clinical decision support: institution-specific probability of celiac disease diagnosis using simplified near-neighbor classification

Brian H Shirts et al. J Gen Intern Med. 2013 Dec.

Abstract

Background: Interpretation of a diagnostic test result requires knowing what proportion of patients with a "similar" result has the condition in question. This information is often not readily available from the medical literature, or may be based on different clinical populations that make it nonapplicable. In certain settings, where correlated screening parameters and diagnostic data are available in electronic medical records, a representation of diagnostic test performance on "patients like my patient" can be obtained.

Objective: We sought to integrate patient demographic and physician practice information using a simplified nearest neighbor algorithm. We used this method to illustrate the relationship between tTG IgA test result and duodenal biopsy for celiac disease in a local diagnostic context.

Participants: We used a data set of 1,461 paired tissue transglutaminase (tTG) IgA and definitive duodenal biopsy results from Intermountain Healthcare with data on patient age and ordering physician specialty. This was split into a discovery set of 1,000 and a validation set of 461 paired results.

Main measures: Accuracy of the local discovery data set in predicting probability of positive duodenal biopsy and confidence intervals around predicted probability in the test data compared to probabilities of positive biopsy implied from published logistic regression and from published sensitivity and specificity studies.

Key results: The near-neighbor method could estimate probability of clinical outcomes with predictive performance equivalent to other methods while adjusting probability estimates and confidence intervals to fit specific clinical situations.

Conclusions: Data from clinical encounters obtained from electronic medical records can yield prediction estimates that are tailored to the individual patient, local population, and healthcare delivery processes. Local analysis of diagnostic probability may be more clinically meaningful than probabilities inferred from published studies. This local utility may come at the expense of external validity and generalizability.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Illustration of the near-neighbor method for two-dimensional analysis to predict probability of duodenal biopsy using only tTG IgA and patient age in a discovery sample of 1,000 Intermountain Healthcare patients. Natural log-transformed tTG IgA values are plotted by age. The orange box corresponds to the near-neighbor space surrounding a 17-year-old patient with a tTG IgA of 3 units. Of the 161 points inside the orange box (some of which are plotted on top of others), six represent positive biopsies, indicating a 3.9 % chance this patient will have a positive biopsy (95 % CI [1.4 % to 8.2 %]). The green box corresponds to the near-neighbor space surrounding a 58-year-old patient with a tTG IgA of 55 units. Of the 16 points in this box, ten represent positive biopsies indicating a 63 % chance (95 % CI [35 % to 85 %]) this patient will have a positive biopsy. (Note that tTG IgA values of 0 were transformed to −1, since ln(0) is undefined).
Figure 2.
Figure 2.
Illustration of the effect of the size of the near-neighbor space on the probability of a positive biopsy as a function of tTG IgA. For large, middle, and small near-neighbor spaces, tTG IgA space length was 0.77, 0.53, and 0.26 ln(tTG IgA) units, respectively. Near-neighbor analysis using very large and very small cubes is shown as dotted black and dotted gray lines, respectively. In near-neighbor analysis, smaller cube lengths created less smooth probability estimate curves and larger cube lengths decreased the predictive power at high tTG IgA levels. This figure illustrates that box size for clinically meaningful probabilities and adequate smoothness can be determined empirically.
Figure 3.
Figure 3.
Comparison of probability estimates of positive duodenal biopsy from near-neighbor analysis with those derived from published sensitivity and specificity and from published logistic regression. Black points—near-neighbor probability estimates for 461 individuals in the validation data set using tTG IgA and age, with near-neighbor estimates derived from the Intermountain Healthcare discovery data set. Red lines probability estimates inferred from sensitivity and specificity reported by meta-analysis of tTG IgA studies. Blue line—probability of positive biopsy using published logistic regression. Orange vertical lines show the tTG IgA manufacturer’s cutoffs for “weak positive” and “moderate to strong positive.”
Figure 4.
Figure 4.
ROC analysis comparing relative accuracy of near-neighbor analysis by using predicted probabilities as risk scores.
Figure 5.
Figure 5.
Bland–Altman analyses. a Near-neighbor probabilities are compared to binary probability estimates derived from reported sensitivity and specificity. b Near-neighbor probabilities are compared to estimates based on published logistic regression. Logistic regression appears to overestimate probability of positive biopsy for most patients.
None

Comment in

  • Medicine based upon data.
    Safran C. Safran C. J Gen Intern Med. 2013 Dec;28(12):1545-6. doi: 10.1007/s11606-013-2549-3. J Gen Intern Med. 2013. PMID: 23838902 Free PMC article. No abstract available.

References

    1. Leffler DA, Schuppan D. Update on serologic testing in celiac disease. Am J Gastroenterol. 2010;105:2520–4. doi: 10.1038/ajg.2010.276. - DOI - PubMed
    1. Giersiepen K, Lelgemann M, Stuhldreher N, Ronfani L, Husby S, Koletzko S, Korponay-Szabo IR. Accuracy of diagnostic antibody tests for coeliac disease in children: summary of an evidence report. J Pediatr Gastroenterol Nutr. 2012;54:229–41. doi: 10.1097/MPG.0b013e318216f2e5. - DOI - PubMed
    1. Parakkal D, Du H, Semer R, Ehrenpreis ED, Guandalini S. Do gastroenterologists adhere to diagnostic and treatment guidelines for celiac disease? J Clin Gastroenterol. 2012;46:e12–20. doi: 10.1097/MCG.0b013e31822f0da0. - DOI - PubMed
    1. Husby S, Koletzko S, Korponay-Szabo IR, Mearin ML, Phillips A, Shamir R, Troncone R, et al. European Society for Pediatric Gastroenterology, Hepatology, and Nutrition guidelines for the diagnosis of coeliac disease. J Pediatr Gastroenterol Nutr. 2012;54:136–60. doi: 10.1097/MPG.0b013e31821a23d0. - DOI - PubMed
    1. Rozenberg O, Lerner A, Pacht A, Grinberg M, Reginashvili D, Henig C, Barak M. A novel algorithm for the diagnosis of celiac disease and a comprehensive review of celiac disease diagnostics. Clin Rev Allergy Immunol. 2012;42:331–41. doi: 10.1007/s12016-010-8250-y. - DOI - PubMed

Publication types

Substances