Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb 21:13:781740.
doi: 10.3389/fgene.2022.781740. eCollection 2022.

Gene Region Association Analysis of Longitudinal Quantitative Traits Based on a Function-On-Function Regression Model

Affiliations

Gene Region Association Analysis of Longitudinal Quantitative Traits Based on a Function-On-Function Regression Model

Shijing Li et al. Front Genet. .

Abstract

In the process of growth and development in life, gene expressions that control quantitative traits will turn on or off with time. Studies of longitudinal traits are of great significance in revealing the genetic mechanism of biological development. With the development of ultra-high-density sequencing technology, the associated analysis has tremendous challenges to statistical methods. In this paper, a longitudinal functional data association test (LFDAT) method is proposed based on the function-on-function regression model. LFDAT can simultaneously treat phenotypic traits and marker information as continuum variables and analyze the association of longitudinal quantitative traits and gene regions. Simulation studies showed that: 1) LFDAT performs well for both linkage equilibrium simulation and linkage disequilibrium simulation, 2) LFDAT has better performance for gene regions (include common variants, low-frequency variants, rare variants and mixture), and 3) LFDAT can accurately identify gene switching in the growth and development stage. The longitudinal data of the Oryza sativa projected shoot area is analyzed by LFDAT. It showed that there is the advantage of quick calculations. Further, an association analysis was conducted between longitudinal traits and gene regions by integrating the micro effects of multiple related variants and using the information of the entire gene region. LFDAT provides a feasible method for studying the formation and expression of longitudinal traits.

Keywords: association testing; functional data analysis; gene region; longitudinal traits; rare variants.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Research status on quantitative traits and genetic variants.
FIGURE 2
FIGURE 2
Power of linkage equilibrium’s case one and case two based on LFDAT for the five gene regions when c is 3, and sample size is 2000. The (A–C) denotes the power results of case one. The (D–F) denotes the power results of case two. The time effect function is θ(t)=2+2sin(πt/12) for case one, and θ(t)=2+2sin(πt/2) for case two. (A) Proportion of causal variants is 1% (B) Proportion of causal variants is 2% (C) Proportion of causal variants is 4%. (D) Proportion of causal variants is 1% (E) Proportion of causal variants is 2% (F) Proportion of causal variants is 4%. Note: Common region denotes gene regions only with common variants, Rare region denotes gene regions only with rare variants, Low region denotes gene regions only with low-frequency variants, Mixture region one denotes gene regions with 20% of common variants and 80% of rare variants, and the Mixture region two denotes gene regions with 80% of common variants and 20% of rare variants.
FIGURE 3
FIGURE 3
Power of linkage disequilibrium’s case one and case two based on LFDAT for the five gene regions when c is 3, and sample size is 2000. The (A–C) denotes the power results of case one. The (D–F) denotes the power results of case two. The time effect function is θ(t)=2+2sin(πt/12) for case one, and θ(t)=2+2sin(πt/2) for case two. Case one: (A) Proportion of causal variants is 1% (B) Proportion of causal variants is 2% (C) Proportion of causal variants is 4%. Case two: (D) Proportion of causal variants is 1% (E) Proportion of causal variants is 2% (F) Proportion of causal variants is 4%. Note: The r 2 measure of linkage disequilibrium is 0.25 to 0.64; Common region denotes gene regions only with common variants, Rare region denotes gene regions only with rare variants, Low region denotes gene regions only with low-frequency variants, Mixture region one denotes gene regions with 20% of common variants and 80% of rare variants, and the Mixture region two denotes gene regions with 80% of common variants and 20% of rare variants.
FIGURE 4
FIGURE 4
The shoot biomass development trajectory of 350 samples. The solid gray line represents the trajectory curve of 350 samples, and the solid black line represents the average trajectory curve.

Similar articles

Cited by

References

    1. Belonogova N. M., Svishcheva G. R., Wilson J. F., Campbell H., Axenovich T. I. (2018). Weighted Functional Linear Regression Models for Gene-Based Association Analysis. PLoS ONE 13, e0190486. 10.1371/journal.pone.0190486 - DOI - PMC - PubMed
    1. Beyene J., Hamid J. S. (2014). Longitudinal Data Analysis in Genome-Wide Association Studies. Genet. Epidemiol. 38, S68–S73. 10.1002/gepi.21828 - DOI - PubMed
    1. Campbell M. T., Knecht A. C., Berger B., Brien C. J., Wang D., Walia H. (2015). Integrating Image-Based Phenomics and Association Analysis to Dissect the Genetic Architecture of Temporal Salinity Responses in rice. Plant Physiol. 168, 1476–1489. 10.1104/pp.15.00450 - DOI - PMC - PubMed
    1. Campbell M., Walia H., Morota G. (2018). Utilizing Random Regression Models for Genomic Prediction of a Longitudinal Trait Derived from High‐Throughput Phenotyping. Plant Direct 2, 1–11. 10.1002/pld3.80 - DOI - PMC - PubMed
    1. Cao H., Li Z., Yang H., Cui Y., Zhang Y. (2017). Longitudinal Data Analysis for Rare Variants Detection with Penalized Quadratic Inference Function. Sci. Rep. 7, 650. 10.1038/s41598-017-00712-9 - DOI - PMC - PubMed

LinkOut - more resources