Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1997 Jan 1;26(3):10.1080/03610927708831932.
doi: 10.1080/03610927708831932.

THE DISTRIBUTION OF COOK'S D STATISTIC

Affiliations

THE DISTRIBUTION OF COOK'S D STATISTIC

Keith E Muller et al. Commun Stat Theory Methods. .

Abstract

Cook (1977) proposed a diagnostic to quantify the impact of deleting an observation on the estimated regression coefficients of a General Linear Univariate Model (GLUM). Simulations of models with Gaussian response and predictors demonstrate that his suggestion of comparing the diagnostic to the median of the F for overall regression captures an erratically varying proportion of the values. We describe the exact distribution of Cook's statistic for a GLUM with Gaussian predictors and response. We also present computational forms, simple approximations, and asymptotic results. A simulation supports the accuracy of the results. The methods allow accurate evaluation of a single value or the maximum value from a regression analysis. The approximations work well for a single value, but less well for the maximum. In contrast, the cut-point suggested by Cook provides widely varying tail probabilities. As with all diagnostics, the data analyst must use scientific judgment in deciding how to treat highlighted observations.

Keywords: influence; regression diagnostics; residual analysis.

PubMed Disclaimer

References

    1. Atkinson AC. Plots, Transformations, and Regression. Clarendon Press; Oxford: 1985.
    1. Belsley DA, Kuh E, Welsch RE. Regression Diagnostics: Identifying Influential Data alld Sources of Collinearity. Wiley; New York: 1980.
    1. Chatterjee, Sand Hadi AS. Influential Observations, High Leverage Points, and Outliers in Linear Regression. Statistical Science. 1986;1:379–416.
    1. Chen Mok M. Evaluating Cook’s D Statistic in Theory and Practice: A Simulation Study. Department of Biostatistics, University of North Carolina; Chapel Hill: 1993. Unpublished Master’s Paper.
    1. Cook RD. Detection of Influential Observations in Linear Regression. Technometrics. 1977;19:15–18.

LinkOut - more resources