Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Nov;207(3):1147-1155.
doi: 10.1534/genetics.117.300283. Epub 2017 Sep 12.

Improving Disease Prediction by Incorporating Family Disease History in Risk Prediction Models with Large-Scale Genetic Data

Affiliations

Improving Disease Prediction by Incorporating Family Disease History in Risk Prediction Models with Large-Scale Genetic Data

Jungsoo Gim et al. Genetics. 2017 Nov.

Abstract

Despite the many successes of genome-wide association studies (GWAS), the known susceptibility variants identified by GWAS have modest effect sizes, leading to notable skepticism about the effectiveness of building a risk prediction model from large-scale genetic data. However, in contrast to genetic variants, the family history of diseases has been largely accepted as an important risk factor in clinical diagnosis and risk prediction. Nevertheless, the complicated structures of the family history of diseases have limited their application in clinical practice. Here, we developed a new method that enables incorporation of the general family history of diseases with a liability threshold model, and propose a new analysis strategy for risk prediction with penalized regression analysis that incorporates both large numbers of genetic variants and clinical risk factors. Application of our model to type 2 diabetes in the Korean population (1846 cases and 1846 controls) demonstrated that single-nucleotide polymorphisms accounted for 32.5% of the variation explained by the predicted risk scores in the test data set, and incorporation of family history led to an additional 6.3% improvement in prediction. Our results illustrate that family medical history provides valuable information on the variation of complex diseases and improves prediction performance.

Keywords: Genetic variability in complex binary traits; Liability threshold model; family history; penalized prediction model; risk prediction in complex disease.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A schematic overview of the analysis. Individuals in the main data set include genotyped SNPs, while the external data set of those individuals includes relative’s relationship and disease status. A 10-fold cross-validation scheme was applied to build and test the performance of the prediction models
Figure 2
Figure 2
Clinical variables between cases and controls. Conditional mean (CM) (A), age (B), sex (C), body mass index (BMI) (D), systolic blood pressure (SBP) (E), and diastolic blood pressure (DBP) (F) are shown in boxplots. Two-sample t-test was performed to obtain P-values (* < 0.05; *** < 0.001). For sex, a χ2 test was conducted.
Figure 3
Figure 3
Model comparison with different family history measures. Prediction performance (AUC) is depicted without family history (red), with weighted mean (blue), and with conditional mean (green). AUC, area under the curve; EN, Elastic-Net; SCAD, smoothly clipped absolute deviation; T.Ridge, Truncated Ridge.
Figure 4
Figure 4
Proportion of variation explained by each variable in the final model. For six clinical variables [age, sex, body mass index (BMI), systolic blood pressure (SBP), diastolic blood pressure (DBP), and conditional mean (CM)], the individual proportions of variation are shown, whereas variation explained by the 5000 SNPs is shown according to their summed proportion.

References

    1. Aekplakorn W., Bunnag P., Woodward M., Sritara P., Cheepudomwit S., et al. , 2006. A risk score for predicting incident diabetes in the Thai population. Diabetes Care 29: 1872–1877. - PubMed
    1. Chatterjee A., Lahiri S. N., 2011. Bootstrapping Lasso estimators. J. Am. Stat. Assoc. 106: 608–625.
    1. Cheng H., Treglown L., Montgomery S., Furnham A., 2015. Associations between familial factor, trait conscientiousness, gender and the occurrence of type 2 diabetes in adulthood: evidence from a British cohort. PLoS One 10: e0122701. - PMC - PubMed
    1. Clayton D. G., 2009. Prediction and interaction in complex disease genetics: experience in type 1 diabetes. PLoS Genet. 5: e1000540. - PMC - PubMed
    1. Committee on Diabetic Twins, Japan Diabetes Society , 1988. Diabetes mellitus in twins: a cooperative study in Japan. Diabetes Res. Clin. Pract. 5: 271–280. - PubMed

Publication types

MeSH terms