Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Nov;33(7):628-36.
doi: 10.1002/gepi.20414.

The sumLINK statistic for genetic linkage analysis in the presence of heterogeneity

Affiliations

The sumLINK statistic for genetic linkage analysis in the presence of heterogeneity

G B Christensen et al. Genet Epidemiol. 2009 Nov.

Abstract

We present the "sumLINK" statistic--the sum of multipoint LOD scores for the subset of pedigrees with nominally significant linkage evidence at a given locus--as an alternative to common methods to identify susceptibility loci in the presence of heterogeneity. We also suggest the "sumLOD" statistic (the sum of positive multipoint LOD scores) as a companion to the sumLINK. sumLINK analysis identifies genetic regions of extreme consistency across pedigrees without regard to negative evidence from unlinked or uninformative pedigrees. Significance is determined by an innovative permutation procedure based on genome shuffling that randomizes linkage information across pedigrees. This procedure for generating the empirical null distribution may be useful for other linkage-based statistics as well. Using 500 genome-wide analyses of simulated null data, we show that the genome shuffling procedure results in the correct type 1 error rates for both the sumLINK and sumLOD. The power of the statistics was tested using 100 sets of simulated genome-wide data from the alternative hypothesis from GAW13. Finally, we illustrate the statistics in an analysis of 190 aggressive prostate cancer pedigrees from the International Consortium for Prostate Cancer Genetics, where we identified a new susceptibility locus. We propose that the sumLINK and sumLOD are ideal for collaborative projects and meta-analyses, as they do not require any sharing of identifiable data between contributing institutions. Further, loci identified with the sumLINK have good potential for gene localization via statistical recombinant mapping, as, by definition, several linked pedigrees contribute to each peak.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Shuffling procedure for creating null distribution. A) Raw test statistic is calculated across un-shuffled data at regular intervals throughout the genome. Figure shows five pedigrees (rows) and four chromosomes (columns). B) Chromosomes are connected end-to-end in random order within each pedigree, and the resulting loop is broken at a random location to create a new starting point. C) The new starting points are aligned, and null statistics can be calculated along the shuffled genome.
Figure 2
Figure 2
Simplistic illustration of the calculation of the sumLINK statistic. The linkage evidence for three pedigrees (broken lines) are shown across 6 loci. The sumLINK calculated from these three pedigrees in shown by the heavy black line. Pedigree LOD scores that are marked with a diamond meet the threshold for inclusion and these are summed to produce the sumLINK. Pedigree LOD scores marked with a cross do not meet the threshold for inclusion and do not contribute to the sumLINK.
Figure 3
Figure 3
Genome-wide multipoint sumLINK results (dominant model) and sumLOD results (recessive model) for the ICPCG aggressive prostate cancer data.
Figure 4
Figure 4
LOD traces for each pedigree contributing to the linkage results on chromosomes A) 20-dominant, B) 11-recessive, and C) 2-dominant. Black bars indicate the two-recombinant localization regions.

Similar articles

Cited by

References

    1. Abecasis G, Cherny S, Cookson W, Cardon L. Merlin-rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet. 2002;30:97–101. - PubMed
    1. Almasy L, Amos CI, Bailey-Wilson JE, Cantor RM, Jaquish CE, Martinez M, Neuman RJ, Olson JM, Palmer LJ, Rich SM, et al. Genetic Anlaysis Workshop 13: Analysis of Longitudinal Family Data for Complex Diseases and Related Risk Factors. BMC Genetics. 2003;4(Suppl 1):S1. - PMC - PubMed
    1. Camp NJ, Farnham JM, Allen-Brady K, Cannon-Albright LA. Statistical recombinant mapping in extended high-risk Utah pedigrees narrows the 8q24 prostate cancer locus to 2.0 Mb. Prostate. 2007;67(13):1456–64. - PubMed
    1. Camp NJ, Farnham JM, Cannon-Albright LA. Localization of a prostate cancer predisposition gene to an 880-kb region on chromosome 22q12.3 in Utah high-risk pedigrees. Cancer Research. 2006;66(20):10205–12. - PubMed
    1. Camp NJ, Hopkins PN, Hasstedt SJ, Coon H, Malhotra A, Cawthon RM, Hunt SC. Genome-Wide Multipoint Parametric Linkage Analysis of Pulse Pressure in Large, Extended Utah Pedigrees. Hypertension. 2003;43:322–328. - PubMed

Publication types