Identification of multiple rare variants associated with a disease
- PMID: 22373445
- PMCID: PMC3287826
- DOI: 10.1186/1753-6561-5-S9-S103
Identification of multiple rare variants associated with a disease
Abstract
Identifying rare variants that are responsible for complex disease has been promoted by advances in sequencing technologies. However, statistical methods that can handle the vast amount of data generated and that can interpret the complicated relationship between disease and these variants have lagged. We apply a zero-inflated Poisson regression model to take into account the excess of zeros caused by the extremely low frequency of the 24,487 exonic variants in the Genetic Analysis Workshop 17 data. We grouped the 697 subjects in the data set as Europeans, Asians, and Africans based on principal components analysis and found the total number of rare variants per gene for each individual. We then analyzed these collapsed variants based on the assumption that rare variants are enriched in a group of people affected by a disease compared to a group of unaffected people. We also tested the hypothesis with quantitative traits Q1, Q2, and Q4. Analyses performed on the combined 697 individuals and on each ethnic group yielded different results. For the combined population analysis, we found that UGT1A1, which was not part of the simulation model, was associated with disease liability and that FLT1, which was a causal locus in the simulation model, was associated with Q1. Of the causal loci in the simulation models, FLT1 and KDR were associated with Q1 and VNN1 was correlated with Q2. No significant genes were associated with Q4. These results show the feasibility and capability of our new statistical model to detect multiple rare variants influencing disease risk.
Similar articles
-
Identifying rare variants using a Bayesian regression approach.BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S99. doi: 10.1186/1753-6561-5-S9-S99. BMC Proc. 2011. PMID: 22373362 Free PMC article.
-
Application of collapsing methods for continuous traits to the Genetic Analysis Workshop 17 exome sequence data.BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S121. doi: 10.1186/1753-6561-5-S9-S121. BMC Proc. 2011. PMID: 22373425 Free PMC article.
-
Detection of rare variant effects in association studies: extreme values, iterative regression, and a hybrid approach.BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S112. doi: 10.1186/1753-6561-5-S9-S112. BMC Proc. 2011. PMID: 22373188 Free PMC article.
-
Comparison of results from tests of association in unrelated individuals with uncollapsed and collapsed sequence variants using tiled regression.BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S15. doi: 10.1186/1753-6561-5-S9-S15. BMC Proc. 2011. PMID: 22373501 Free PMC article.
-
Identifying causal rare variants of disease through family-based analysis of Genetics Analysis Workshop 17 data set.BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S21. doi: 10.1186/1753-6561-5-S9-S21. BMC Proc. 2011. PMID: 22373204 Free PMC article.
Cited by
-
Using LASSO regression to detect predictive aggregate effects in genetic studies.BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S69. doi: 10.1186/1753-6561-5-S9-S69. BMC Proc. 2011. PMID: 22373537 Free PMC article.
-
Regression and data mining methods for analyses of multiple rare variants in the Genetic Analysis Workshop 17 mini-exome data.Genet Epidemiol. 2011;35 Suppl 1(Suppl 1):S92-100. doi: 10.1002/gepi.20657. Genet Epidemiol. 2011. PMID: 22128066 Free PMC article.
-
Precisely modeling zero-inflated count phenotype for rare variants.Genet Epidemiol. 2022 Feb;46(1):73-86. doi: 10.1002/gepi.22438. Epub 2021 Nov 15. Genet Epidemiol. 2022. PMID: 34779034 Free PMC article.
References
LinkOut - more resources
Full Text Sources
Miscellaneous