Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jul 21;112(29):8999-9003.
doi: 10.1073/pnas.1502136112. Epub 2015 Jul 6.

New method to compute Rcomplete enables maximum likelihood refinement for small datasets

Affiliations

New method to compute Rcomplete enables maximum likelihood refinement for small datasets

Jens Luebben et al. Proc Natl Acad Sci U S A. .

Abstract

The crystallographic reliability index [Formula: see text] is based on a method proposed more than two decades ago. Because its calculation is computationally expensive its use did not spread into the crystallographic community in favor of the cross-validation method known as [Formula: see text]. The importance of [Formula: see text] has grown beyond a pure validation tool. However, its application requires a sufficiently large dataset. In this work we assess the reliability of [Formula: see text] and we compare it with k-fold cross-validation, bootstrapping, and jackknifing. As opposed to proper cross-validation as realized with [Formula: see text], [Formula: see text] relies on a method of reducing bias from the structural model. We compare two different methods reducing model bias and question the widely spread notion that random parameter shifts are required for this purpose. We show that [Formula: see text] has as little statistical bias as [Formula: see text] with the benefit of a much smaller variance. Because the calculation of [Formula: see text] is based on the entire dataset instead of a small subset, it allows the estimation of maximum likelihood parameters even for small datasets. [Formula: see text] enables maximum likelihood-based refinement to be extended to virtually all areas of crystallographic structure determination including high-pressure studies, neutron diffraction studies, and datasets from free electron lasers.

Keywords: maximum likelihood refinement; model bias; overfitting; reliability index; structure determination.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

References

    1. Berman HM, et al. The Protein Data Bank. Nucleic Acids Res. 2000;28(1):235–242. - PMC - PubMed
    1. Kennard O. Cambridge Crystallographic Database. Acta Crystallogr A. 1981;37:C343.
    1. Kleywegt GJ, Jones TA. Where freedom is given, liberties are taken. Structure. 1995;3(6):535–540. - PubMed
    1. Kleywegt GJ, Brünger AT. Checking your imagination: Applications of the free R value. Structure. 1996;4(8):897–904. - PubMed
    1. Brünger AT. Free R value: A novel statistical quantity for assessing the accuracy of crystal structures. Nature. 1992;355(6359):472–475. - PubMed

LinkOut - more resources