Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jan;15(1):129-39.
doi: 10.1093/biostatistics/kxt039. Epub 2013 Oct 4.

A conditional predictive p-value to compare a multinomial with an overdispersed multinomial in the analysis of T-cell populations

Affiliations

A conditional predictive p-value to compare a multinomial with an overdispersed multinomial in the analysis of T-cell populations

Qinglin Pei et al. Biostatistics. 2014 Jan.

Abstract

Immunological experiments that record primary molecular sequences of T-cell receptors produce moderate to high-dimensional categorical data, some of which may be subject to extra-multinomial variation caused by technical constraints of cell-based assays. Motivated by such experiments in melanoma research, we develop a statistical procedure for testing the equality of two discrete populations, where one population delivers multinomial data and the other is subject to a specific form of overdispersion. The procedure computes a conditional-predictive p-value by splitting the data set into two, obtaining a predictive distribution for one piece given the other, and using the observed predictive ordinate to generate a p-value. The procedure has a simple interpretation, requires fewer modeling assumptions than would be required of a fully Bayesian analysis, and has reasonable operating characteristics as evidenced empirically and by asymptotic analysis.

Keywords: Bayesian p-value; Dirichlet multinomial; Double overdispersion; Fisher's exact test; HPRT assay; Mass culture experiments; Molecular sequence data; T-cell receptor.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Summary statistics for J region frequencies from Tables 1 and 2. Colored bars show empirical frequencies of each J region type, from various data sources. Boxplots summarize posterior analysis of the underlying proportions conditional on the MT (not WT) data. Types are arranged from top to bottom by increasing value of the posterior median proportion. Note that the boxplots track the green bars better than the red bars, since the SC data are less variable than the MC data. The hypothesis test asks if the common (vector over types) mean of the MT data differs significantly from the mean of the WT cells (blue).

References

    1. Agresti A. Categorical Data Analysis. New York: Wiley; 1990.
    1. Albertini R. J. HPRT mutations in humans: biomarkers for mechanistic studies. Mutation Research. 2001;489:1–16. - PubMed
    1. Albertini M. R., King D. M., Newton M. A., Vacek P. M. In vivo mutant frequency of thioguanine-resistant T-cells in the peripheral blood and lymph nodes of melanoma patients. Mutation Research. 2001;476:83–97. - PubMed
    1. Albertini M. R., Macklin M. D., Zuleger C. L., Newton M. A., Judice S. A., Albertini R. J. Clonal expansions of 6-thioguanine resistant T lymphocytes in the blood and tumor of melanoma patients. Environmental and Molecular Mutagenesis. 2008;49:676–687. - PMC - PubMed
    1. Bayarri M. J., Berger J. O. Quantifying surprise in the data and model verification. In: Bernardo J. M., Berger J. O., Dawid A. P., Smith A. F. M., editors. Bayesian Statistics 6. London: Oxford University Press; 1999. pp. 53–82.

Publication types

MeSH terms

Substances

LinkOut - more resources