Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 May 7;258(1):103-11.
doi: 10.1016/j.jtbi.2009.01.023. Epub 2009 Feb 4.

Genetic model for longitudinal studies of aging, health, and longevity and its potential application to incomplete data

Affiliations

Genetic model for longitudinal studies of aging, health, and longevity and its potential application to incomplete data

Konstantin G Arbeev et al. J Theor Biol. .

Abstract

Many longitudinal studies of aging collect genetic information only for a sub-sample of participants of the study. These data also do not include recent findings, new ideas and methodological concepts developed by distinct groups of researchers. The formal statistical analyses of genetic data ignore this additional information and therefore cannot utilize the entire research potential of the data. In this paper, we present a stochastic model for studying such longitudinal data in joint analyses of genetic and non-genetic sub-samples. The model incorporates several major concepts of aging known to date and usually studied independently. These include age-specific physiological norms, allostasis and allostatic load, stochasticity, and decline in stress resistance and adaptive capacity with age. The approach allows for studying all these concepts in their mutual connection, even if respective mechanisms are not directly measured in data (which is typical for longitudinal data available to date). The model takes into account dependence of longitudinal indices and hazard rates on genetic markers and permits evaluation of all these characteristics for carriers of different alleles (genotypes) to address questions concerning genetic influence on aging-related characteristics. The method is based on extracting genetic information from the entire sample of longitudinal data consisting of genetic and non-genetic sub-samples. Thus it results in a substantial increase in the accuracy of statistical estimates of genetic parameters compared to methods that use only information from a genetic sub-sample. Such an increase is achieved without collecting additional genetic data. Simulation studies illustrate the increase in the accuracy in different scenarios for datasets structurally similar to the Framingham Heart Study. Possible applications of the model and its further generalizations are discussed.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Simulation study: Comparison of estimates for carriers of a hypothetical allele (genotype) obtained in 100 simulated data sets by three methods. Left column: Estimates of age trajectories (solid grey lines) of logarithm of baseline hazard (ln μ0 (1,t)), quadratic hazard (QH) terms (Q(1, t)), and age-dependent norms (f(1, t)) calculated using only sub-samples with genetic information (500 individuals). Middle column: Similar estimates when the entire sample (2500 individuals) contains genetic information. Right column: The estimates calculated using the joint analysis of genetic (500 individuals) and non-genetic (2000 individuals) data. Respective “true” trajectories used for simulation of data are shown as dashed black lines; t denotes age.
Fig. 2
Fig. 2
Simulation study: Comparison of estimates for non-carriers of a hypothetical allele (genotype) obtained in 100 simulated data sets by three methods. Left column: Estimates of age trajectories (solid grey lines) of logarithm of baseline hazard (ln μ0 (0,t) ), quadratic hazard (QH) terms (Q(0, t)), and age-dependent norms (f(0, t)) calculated using only sub-samples with genetic information (500 individuals). Middle column: Similar estimates when the entire sample (2500 individuals) contains genetic information. Right column: The estimates calculated using the joint analysis of genetic (500 individuals) and non-genetic (2000 individuals) data. Respective “true” trajectories used for simulation of data are shown as dashed black lines; t denotes age.

Similar articles

Cited by

References

    1. Allison DB, Faith MS, Heo M, Kotler DP. Hypothesis concerning the U-shaped relation between body mass index and mortality. Am J Epidemiol. 1997;146:339–349. - PubMed
    1. Boutitie F, Gueyffier F, Pocock S, Fagard R, Boissel JP. J-shaped relationship between blood pressure and mortality in hypertensive patients: New insights from a meta-analysis of individual-patient data. Ann Intern Med. 2002;136:438–448. - PubMed
    1. Breslow NE, Cain KC. Logistic regression for two-stage case-control data. Biometrika. 1988;75:11–20.
    1. Breslow NE, Holubkov R. Weighted likelihood, pseudo-likelihood and maximum likelihood methods for logistic regression analysis of two-stage data. Stat Med. 1997a;16:103–116. - PubMed
    1. Breslow NE, Holubkov R. Maximum likelihood estimation of logistic regression parameters under two-phase, outcome-dependent sampling. J Royal Stat Soc B. 1997b;59:447–461.

Publication types