Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Oct 15:7:9.
doi: 10.1186/1742-5573-7-9.

Categorisation of continuous risk factors in epidemiological publications: a survey of current practice

Affiliations

Categorisation of continuous risk factors in epidemiological publications: a survey of current practice

Elizabeth L Turner et al. Epidemiol Perspect Innov. .

Abstract

Background: Reports of observational epidemiological studies often categorise (group) continuous risk factor (exposure) variables. However, there has been little systematic assessment of how categorisation is practiced or reported in the literature and no extended guidelines for the practice have been identified. Thus, we assessed the nature of such practice in the epidemiological literature. Two months (December 2007 and January 2008) of five epidemiological and five general medical journals were reviewed. All articles that examined the relationship between continuous risk factors and health outcomes were surveyed using a standard proforma, with the focus on the primary risk factor. Using the survey results we provide illustrative examples and, combined with ideas from the broader literature and from experience, we offer guidelines for good practice.

Results: Of the 254 articles reviewed, 58 were included in our survey. Categorisation occurred in 50 (86%) of them. Of those, 42% also analysed the variable continuously and 24% considered alternative groupings. Most (78%) used 3 to 5 groups. No articles relied solely on dichotomisation, although it did feature prominently in 3 articles. The choice of group boundaries varied: 34% used quantiles, 18% equally spaced categories, 12% external criteria, 34% other approaches and 2% did not describe the approach used. Categorical risk estimates were most commonly (66%) presented as pairwise comparisons to a reference group, usually the highest or lowest (79%). Reporting of categorical analysis was mostly in tables; only 20% in figures.

Conclusions: Categorical analyses of continuous risk factors are common. Accordingly, we provide recommendations for good practice. Key issues include pre-defining appropriate choice of groupings and analysis strategies, clear presentation of grouped findings in tables and figures, and drawing valid conclusions from categorical analyses, avoiding injudicious use of multiple alternative analyses.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Example from the survey: categorical results displayed as a figure. Relative risk (& 95% CI) for coronary events by quintiles of NO2 (μg/m3) exposure [17]. Reference category is bottom fifth, trend line is fitted. More emphasis given to quantitative analysis. This image is reproduced with permission from Epidemiology.

References

    1. Altman DG, Royston P. The cost of dichotomising continuous variables. Br Med J. 2006;332:1080. doi: 10.1136/bmj.332.7549.1080. - DOI - PMC - PubMed
    1. Royston P, Altman DG, Sauerbrei W. Dichotomizing continuous predictors in multiple regression: a bad idea. Stat Med. 2006;25:127–141. doi: 10.1002/sim.2331. - DOI - PubMed
    1. Cochran WG. The effectiveness of adjustment by subclassification in removing bias in observational studies. Biometrics. 1968;24:295–313. doi: 10.2307/2528036. - DOI - PubMed
    1. Connor RJ. Grouping for testing trends in categorical data. J Am Stat Assoc. 1972;67:601–604. doi: 10.2307/2284446. - DOI
    1. Cox DR. Note on grouping. J Am Stat Assoc. 1957;52:543–547. doi: 10.2307/2281704. - DOI

LinkOut - more resources