Analysis of family- and population-based samples in cohort genome-wide association studies
- PMID: 21805149
- PMCID: PMC3369696
- DOI: 10.1007/s00439-011-1071-0
Analysis of family- and population-based samples in cohort genome-wide association studies
Abstract
Cohort studies typically sample unrelated individuals from a population, although family members of index cases may also be recruited to investigate shared familial risk factors. Recruitment of family members may be incomplete or ancillary to the main cohort, resulting in a mixed sample of independent family units, including unrelated singletons and multiplex families. Multiple methods are available to perform genome-wide association (GWA) analysis of binary or continuous traits in families, but it is unclear whether methods known to perform well on ascertained pedigrees, sibships, or trios are appropriate in analysis of a mixed unrelated cohort and family sample. We present simulation studies based on Multi-Ethnic Study of Atherosclerosis (MESA) pedigree structures to compare the performance of several popular methods of GWA analysis for both quantitative and dichotomous traits in cohort studies. We evaluate approaches suitable for analysis of families, and combined the best performing methods with population-based samples either by meta-analysis, or by pooled analysis of family- and population-based samples (mega-analysis), comparing type 1 error and power. We further assess practical considerations, such as availability of software and ability to incorporate covariates in statistical modeling, and demonstrate our recommended approaches through quantitative and binary trait analysis of HDL cholesterol (HDL-C) in 2,553 MESA family- and population-based African-American samples. Our results suggest linear modeling approaches that accommodate family-induced phenotypic correlation (e.g., variance-component model for quantitative traits or generalized estimating equations for dichotomous traits) perform best in the context of combined family- and population-based cohort GWAS.
Figures


Similar articles
-
Modeling the Dependence Structure in Genome Wide Association Studies of Binary Phenotypes in Family Data.Behav Genet. 2020 Nov;50(6):423-439. doi: 10.1007/s10519-020-10010-2. Epub 2020 Aug 17. Behav Genet. 2020. PMID: 32804302 Free PMC article.
-
Single Marker Family-Based Association Analysis Not Conditional on Parental Information.Methods Mol Biol. 2017;1666:409-439. doi: 10.1007/978-1-4939-7274-6_20. Methods Mol Biol. 2017. PMID: 28980257 Review.
-
Fast Genome-Wide QTL Association Mapping on Pedigree and Population Data.Genet Epidemiol. 2017 Apr;41(3):174-186. doi: 10.1002/gepi.21988. Epub 2016 Dec 12. Genet Epidemiol. 2017. PMID: 27943406 Free PMC article.
-
Detecting Familial Aggregation.Methods Mol Biol. 2017;1666:133-169. doi: 10.1007/978-1-4939-7274-6_8. Methods Mol Biol. 2017. PMID: 28980245
-
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217. Cochrane Database Syst Rev. 2022. PMID: 36321557 Free PMC article.
Cited by
-
The impact of disregarding family structure on genome-wide association analysis of complex diseases in cohorts with simple pedigrees.J Appl Genet. 2020 Feb;61(1):75-86. doi: 10.1007/s13353-019-00526-7. Epub 2019 Nov 21. J Appl Genet. 2020. PMID: 31755004 Free PMC article.
-
Gene-gene interactions in APOL1-associated nephropathy.Nephrol Dial Transplant. 2014 Mar;29(3):587-94. doi: 10.1093/ndt/gft423. Epub 2013 Oct 24. Nephrol Dial Transplant. 2014. PMID: 24157943 Free PMC article.
-
LEVERAGING LOCAL IDENTITY-BY-DESCENT INCREASES THE POWER OF CASE/CONTROL GWAS WITH RELATED INDIVIDUALS.Ann Appl Stat. 2014 Jun;8(2):974-998. doi: 10.1214/14-aoas715. Ann Appl Stat. 2014. PMID: 25544865 Free PMC article.
-
Integrating SNP data to reveal the adaptive selection features of goat populations in extreme environments.BMC Genomics. 2025 Jun 2;26(1):553. doi: 10.1186/s12864-025-11743-2. BMC Genomics. 2025. PMID: 40457191 Free PMC article.
-
Efficient generalized least squares method for mixed population and family-based samples in genome-wide association studies.Genet Epidemiol. 2014 Jul;38(5):430-8. doi: 10.1002/gepi.21811. Epub 2014 May 20. Genet Epidemiol. 2014. PMID: 24845555 Free PMC article.
References
-
- Abecasis GR, Cherny SS, Cookson WO, Cardon LR. Merlin--rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet. 2002;30:97–101. - PubMed
-
- Agresti A. Categorical data analysis. 2nd edn. New York: Wiley-Interscience; 2002.
-
- American Heart Association. What Your Cholesterol Levels Mean. What Your Levels Mean. 2010;vol 2011 http://www.americanheart.org/presenter.jhtml?identifier=183)
Publication types
MeSH terms
Grants and funding
- N02-HL-6-4278/HL/NHLBI NIH HHS/United States
- R01 HL071251/HL/NHLBI NIH HHS/United States
- R01 HL071259/HL/NHLBI NIH HHS/United States
- RR-024156/RR/NCRR NIH HHS/United States
- N01-HC-95159/HC/NHLBI NIH HHS/United States
- UL1 TR000124/TR/NCATS NIH HHS/United States
- R01HL071205/HL/NHLBI NIH HHS/United States
- N01-HC-95169/HC/NHLBI NIH HHS/United States
- R01 HL071205/HL/NHLBI NIH HHS/United States
- R01HL071259/HL/NHLBI NIH HHS/United States
- R01HL071251/HL/NHLBI NIH HHS/United States
- R01HL071258/HL/NHLBI NIH HHS/United States
- R01HL071250/HL/NHLBI NIH HHS/United States
- N01 HC095169/HL/NHLBI NIH HHS/United States
- R01 HL071252/HL/NHLBI NIH HHS/United States
- R01 HL071250/HL/NHLBI NIH HHS/United States
- UL1 RR024156/RR/NCRR NIH HHS/United States
- R01HL071252/HL/NHLBI NIH HHS/United States
- N01 HC095159/HL/NHLBI NIH HHS/United States
- P30 DK063491/DK/NIDDK NIH HHS/United States
- R01 HL071051/HL/NHLBI NIH HHS/United States
- R01HL071051/HL/NHLBI NIH HHS/United States
- R01 HL071258/HL/NHLBI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Medical
Molecular Biology Databases