Practical considerations in the analysis of complex sample survey data
- PMID: 10587999
Practical considerations in the analysis of complex sample survey data
Abstract
Large scale sample surveys often provide fertile ground for analyses by epidemiologists. Recently, survey organizations such as the National Center for Health Statistics and the United States Bureau of the Census have distributed data from large surveys to interested investigators via CD-ROM. Confronted by the richness of such databases and the historic relative lack of availability of suitable software to appropriately account for the survey design, researchers have often simply ignored the complexities of the survey and analyzed the data as if they resulted from a simple random sample. The availability of modern programs such as STATA and SUDAAN provides data analysts with the new analytical capabilities to perform design-based analyses whenever appropriate. We used data from the NHANES III and the PAQUID study to illustrate the ease of performing design-based analyses. We also compared results of analyses under both model-based and design-based scenarios. When data from complex sample surveys were analyzed using both model-based and design-based strategies, differences in point estimates and standard errors of means, regression coefficients and odds ratios were found. The differences in regression coefficients and odds ratios between the two strategies were not as great as the differences in means. The potential for differences and the availability of survey analysis software should encourage researchers to use design-based techniques to analyze data from complex sample surveys more appropriately.
Similar articles
-
[Meta-analysis of the Italian studies on short-term effects of air pollution].Epidemiol Prev. 2001 Mar-Apr;25(2 Suppl):1-71. Epidemiol Prev. 2001. PMID: 11515188 Italian.
-
A simplified general method for cluster-sample surveys of health in developing countries.World Health Stat Q. 1991;44(3):98-106. World Health Stat Q. 1991. PMID: 1949887
-
Implementation and applications of bootstrap methods for the National Immunization Survey.Stat Med. 2003 Aug 15;22(15):2487-502. doi: 10.1002/sim.1471. Stat Med. 2003. PMID: 12872304
-
Existing population-based health databases: useful resources for nursing research.Nurs Outlook. 2007 Jan-Feb;55(1):20-30. doi: 10.1016/j.outlook.2006.09.007. Nurs Outlook. 2007. PMID: 17289464 Review.
-
Overview of important design issues for a National Human Exposure Assessment Survey.J Expo Anal Environ Epidemiol. 1995 Jul-Sep;5(3):257-82. J Expo Anal Environ Epidemiol. 1995. PMID: 8814772 Review.
Cited by
-
Variation in L-arginine intake follow demographics and lifestyle factors that may impact cardiovascular disease risk.Nutr Res. 2008 Jan;28(1):21-4. doi: 10.1016/j.nutres.2007.11.003. Nutr Res. 2008. PMID: 19083383 Free PMC article.
-
High prevalence of undiagnosed diabetes mellitus in Southern Germany: target populations for efficient screening. The KORA survey 2000.Diabetologia. 2003 Feb;46(2):182-9. doi: 10.1007/s00125-002-1025-0. Epub 2003 Feb 18. Diabetologia. 2003. PMID: 12627316
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources