Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jun 23:8:15927.
doi: 10.1038/ncomms15927.

Enrichment of low-frequency functional variants revealed by whole-genome sequencing of multiple isolated European populations

Affiliations

Enrichment of low-frequency functional variants revealed by whole-genome sequencing of multiple isolated European populations

Yali Xue et al. Nat Commun. .

Abstract

The genetic features of isolated populations can boost power in complex-trait association studies, and an in-depth understanding of how their genetic variation has been shaped by their demographic history can help leverage these advantageous characteristics. Here, we perform a comprehensive investigation using 3,059 newly generated low-depth whole-genome sequences from eight European isolates and two matched general populations, together with published data from the 1000 Genomes Project and UK10K. Sequencing data give deeper and richer insights into population demography and genetic characteristics than genotype-chip data, distinguishing related populations more effectively and allowing their functional variants to be studied more fully. We demonstrate relaxation of purifying selection in the isolates, leading to enrichment of rare and low-frequency functional variants, using novel statistics, DVxy and SVxy. We also develop an isolation-index (Isx) that predicts the overall level of such key genetic characteristics and can thus help guide population choice in future complex-trait association studies.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Figure 1
Figure 1. General characteristics and demographic history of isolated and matched general populations.
(a) Geographical locations of samples. The base map was plotted in R using the mapdata package and circles were added using Photoshop. (b) PCA using common variants. (c) PCA using low-frequency variants. (d) Sharing of rare variants within and between populations. Upper left triangle: f2 variants; lower right triangle f3f10 variants. (e) Effective population size (Ne) inferred from IBDNe for UKO and UKG during the past nine KY. (f) The lowest Ne inferred by IBDNe for all populations for the past three KY, plotted as a function of the time at which it occurred.
Figure 2
Figure 2. Isolation index (Isx) and its correlation with other genetic measures.
(a) Information summarized in Isx. (b) Example of the correlation between Isx and other statistics, here DVxy-coding. (c) Summary of the correlations between Isx and other population-genetic statistics. All the correlation coefficients are high and statistically significant.
Figure 3
Figure 3. Purifying selection in the isolates and general populations.
(a) Rxy-missense statistic in each isolate, showing no evidence for increased genetic load in the isolates. The mean and s.d. for each Rxy value from 100 bootstraps are shown. (b) DVxy-wg (DVxy-whole genome) statistic in isolates and general populations, stratified by CADD score, showing enrichment of highly functional low-frequency variants. (c) DVxy-coding statistic in isolates and general populations, showing enrichment of low-frequency missense variants in isolates. (d) SVxy-missense statistic in each isolate, showing relaxation of purifying selection in isolates in singletons. The s.e.'s for both DVxy and SVxy were calculated by randomly sampling data from 20 chromosomes 100 times. All of these analyses are based on the minimum-sample-size data set (36 individuals from each population).

References

    1. Zeggini E. Using genetically isolated populations to understand the genomic basis of disease. Genome Med. 6, 83 (2014). - PMC - PubMed
    1. Hatzikotoulas K., Gilly A. & Zeggini E. Using population isolates in genetic association studies. Brief Funct. Genomics 13, 371–377 (2014). - PMC - PubMed
    1. Zuk O. et al. Searching for missing heritability: designing rare variant association studies. Proc. Natl Acad. Sci. USA 111, E455–E464 (2014). - PMC - PubMed
    1. Pollin T. I. et al. A null mutation in human APOC3 confers a favorable plasma lipid profile and apparent cardioprotection. Science 322, 1702–1705 (2008). - PMC - PubMed
    1. Gudmundsson J. et al. A study based on whole-genome sequencing yields a rare variant at 8q24 associated with prostate cancer. Nat. Genet. 44, 1326–1329 (2012). - PMC - PubMed

Publication types