Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Sep;174(1):481-9.
doi: 10.1534/genetics.106.061549. Epub 2006 Jun 18.

Poor performance of bootstrap confidence intervals for the location of a quantitative trait locus

Affiliations

Poor performance of bootstrap confidence intervals for the location of a quantitative trait locus

Ani Manichaikul et al. Genetics. 2006 Sep.

Abstract

The aim of many genetic studies is to locate the genomic regions (called quantitative trait loci, QTL) that contribute to variation in a quantitative trait (such as body weight). Confidence intervals for the locations of QTL are particularly important for the design of further experiments to identify the gene or genes responsible for the effect. Likelihood support intervals are the most widely used method to obtain confidence intervals for QTL location, but the nonparametric bootstrap has also been recommended. Through extensive computer simulation, we show that bootstrap confidence intervals behave poorly and so should not be used in this context. The profile likelihood (or LOD curve) for QTL location has a tendency to peak at genetic markers, and so the distribution of the maximum-likelihood estimate (MLE) of QTL location has the unusual feature of point masses at genetic markers; this contributes to the poor behavior of the bootstrap. Likelihood support intervals and approximate Bayes credible intervals, on the other hand, are shown to behave appropriately.

PubMed Disclaimer

Figures

F<sc>igure</sc> 1.—
Figure 1.—
Results for the chromosome 4 data of Sugiyama et al. (2001). (A) The LOD curve and the 1.5-LOD support interval. Tick marks at the bottom indicate the locations of the genetic markers. (B) A histogram of the estimated QTL locations in 10,000 bootstrap replicates, and the 95% bootstrap confidence interval, calculated by the method of Visscher et al. (1996).
F<sc>igure</sc> 2.—
Figure 2.—
Estimated distribution of the MLE of QTL location, formula image, as a function of the true location of the QTL, θ, for θ varying from 45 to 50. The results are based on 10,000 simulation replicates of a backcross with 200 individuals for a chromosome of length 100 cM and having 11 equally spaced markers and with the heritability due to the QTL at 10%.
F<sc>igure</sc> 3.—
Figure 3.—
Coverage of 95% bootstrap confidence intervals (black), 1-LOD support intervals (red), and 95% Bayes credible intervals (blue) as a function of the true QTL position, θ. The dashed vertical gray lines denote marker positions on the chromosome.
F<sc>igure</sc> 4.—
Figure 4.—
Coverage of 95% bootstrap confidence intervals (black), 1-LOD support intervals (red), and 95% Bayes credible intervals (blue) as a function of the MLE of QTL position, formula image. The dashed vertical gray lines denote marker positions on the chromosome.
F<sc>igure</sc> 5.—
Figure 5.—
Estimated amount to drop in a LOD support interval (A and C) and the nominal Bayes coverage for the approximate Bayes credible interval (B and D) to give 95% coverage, based on 100,000 simulation replicates. Backcross (A and B) and intercross (C and D) experiments with either 200 (black curves) or 500 (red curves) individuals were considered. The line types indicate different possible marker spacings. Values are plotted as a function of the effect of the QTL, scaled according to the power to detect the QTL.
F<sc>igure</sc> 6.—
Figure 6.—
Coverage of the 1.5-LOD support interval in a backcross (A), the 96.5% Bayes interval in a backcross (B), the 1.8-LOD support interval in an intercross (C), and the 97% Bayes interval in an intercross (D), based on 100,000 simulation replicates. The black curves are for 200 individuals; the red curves are for 500 individuals. The line types indicate different possible marker spacings. Values are plotted as a function of the effect of the QTL, scaled according to the power to detect the QTL.

Similar articles

Cited by

References

    1. Beran, R., 2003. The impact of the bootstrap on statistical algorithms and theory. Stat. Sci. 18: 175–184.
    1. Broman, K. W., H. Wu, Ś. Sen and G. A. Churchill, 2003. R/qtl: QTL mapping in experimental crosses. Bioinformatics 19: 889–890. - PubMed
    1. Dempster, A. P., N. M. Laird and D. B. Rubin, 1977. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. B 39: 1–38.
    1. Dupuis, J., and D. Siegmund, 1999. Statistical methods for mapping quantitative trait loci from a dense set of markers. Genetics 151: 373–386. - PMC - PubMed
    1. Haley, C. S., and S. A. Knott, 1992. A simple regression method for mapping quantitative trait loci in line crosses using flanking markers. Heredity 69: 315–324. - PubMed

Publication types