Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jul:43 Suppl 1:19-35.
doi: 10.1111/j.1365-2052.2012.02326.x.

On measures of association among genetic variables

Affiliations
Free PMC article

On measures of association among genetic variables

Daniel Gianola et al. Anim Genet. 2012 Jul.
Free PMC article

Abstract

Systems involving many variables are important in population and quantitative genetics, for example, in multi-trait prediction of breeding values and in exploration of multi-locus associations. We studied departures of the joint distribution of sets of genetic variables from independence. New measures of association based on notions of statistical distance between distributions are presented. These are more general than correlations, which are pairwise measures, and lack a clear interpretation beyond the bivariate normal distribution. Our measures are based on logarithmic (Kullback-Leibler) and on relative 'distances' between distributions. Indexes of association are developed and illustrated for quantitative genetics settings in which the joint distribution of the variables is either multivariate normal or multivariate-t, and we show how the indexes can be used to study linkage disequilibrium in a two-locus system with multiple alleles and present applications to systems of correlated beta distributions. Two multivariate beta and multivariate beta-binomial processes are examined, and new distributions are introduced: the GMS-Sarmanov multivariate beta and its beta-binomial counterpart.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Measures of association of two bivariate Gaussian variables as a function of their correlation (ρ). The straight lines give the strength of the association as measured by the absolute value of ρ. The dotted (‘holds water’: θ) and dashed (‘spills water’: 1 − θ) lines depict the relative contributions to the Kullback-Leibler distance due to discrepancies under independence and dependence models, respectively. Values of the association measure γ=2θ − 1 are represented by the dark solid line.
Figure 2
Figure 2
Indexes of association as a function of the coefficient of correlation in an equi-correlated trivariate normal distribution. The thick solid line gives the relative Kullback-Leibler discrepancy between distributions when independence holds (θ), and the dashed line gives the relative discrepancy when association is true (1 − θ). The thin line gives the trajectory of γ=2θ − 1.
Figure 3
Figure 3
Discrepancies from independence to association (θ, solid line), and from association to independence (1 − θ, dashed line) as a fraction of the Kullback-Leibler distance between two tetravariate normal distributions. The straight lines give the absolute values of the correlation ρ.
Figure 4
Figure 4
Scatter diagrams of 5000 samples from each of four Olkin-Liu bivariate beta distributions. Plot 1 illustrates a strong association with essentially no correlation. Plot 2 (‘meteorite’) depicts a limitation of the correlation as a parameter for describing association. Plot 3 suggests association clearly. Plot 4 shows a bivariate distribution that is not trivial: the true correlation (0.46) arises primarily due to weaker association in the ‘middle’ of the bivariate sampling space.
Figure 5
Figure 5
Plots of the densities of four Lee-Sarmanov bivariate beta distributions having the same Beta (2,2) marginal distributions but differing in the strength of association: as the correlation increases multi-modality emerges.

Similar articles

Cited by

References

    1. Agresti A. Categorical Data Analysis. New York: Wiley; 2002.
    1. Akey JM. Constructing genomic maps of positive selection in humans: where do we go from here? Genome Research. 2009;19:711–22. - PMC - PubMed
    1. Akey JM, Zhang G, Zhang K, Jin L, Shriver MD. Interrogating a high-density SNP map for signatures of natural selection. Genome Research. 2002;12:1805–14. - PMC - PubMed
    1. Balding DJ. Likelihood-based inference for genetic correlations. Theoretical Population Biology. 2003;63:221–30. - PubMed
    1. Beaumont MA, Balding DJ. Identifying adaptive genetic divergence among populations from genome scans. Molecular Ecology. 2004;13:969–80. - PubMed

Publication types