Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Sep 21:4:170115.
doi: 10.1038/sdata.2017.115.

Whole genome characterization of sequence diversity of 15,220 Icelanders

Affiliations

Whole genome characterization of sequence diversity of 15,220 Icelanders

Hákon Jónsson et al. Sci Data. .

Abstract

Understanding of sequence diversity is the cornerstone of analysis of genetic disorders, population genetics, and evolutionary biology. Here, we present an update of our sequencing set to 15,220 Icelanders who we sequenced to an average genome-wide coverage of 34X. We identified 39,020,168 autosomal variants passing GATK filters: 31,079,378 SNPs and 7,940,790 indels. Calling de novo mutations (DNMs) is a formidable challenge given the high false positive rate in sequencing datasets relative to the mutation rate. Here we addressed this issue by using segregation of alleles in three-generation families. Using this transmission assay, we controlled the false positive rate and identified 108,778 high quality DNMs. Furthermore, we used our extended family structure and read pair tracing of DNMs to a panel of phased SNPs, to determine the parent of origin of 42,961 DNMs.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Figure 1
Figure 1. Schematic overview of the DNM characterization.
Figure 2
Figure 2. The GAM model predicted response for all DNM candidates.
The red line corresponds to the 0.8 GAM response requirement for the high quality DNMs.
Figure 3
Figure 3. The fraction of discordant DNMs between MZ twins.
There were used 91 monozygotic twin pairs for the discordance calculation. The discordance fraction was calculated as the fraction of the proband’s high quality DNMs not found in the MZ twin.

Dataset use reported in

  • doi: 10.1038/nature24018

Similar articles

Cited by

  • The genetic architecture of age-related hearing impairment revealed by genome-wide association analysis.
    Ivarsdottir EV, Holm H, Benonisdottir S, Olafsdottir T, Sveinbjornsson G, Thorleifsson G, Eggertsson HP, Halldorsson GH, Hjorleifsson KE, Melsted P, Gylfason A, Arnadottir GA, Oddsson A, Jensson BO, Jonasdottir A, Jonasdottir A, Juliusdottir T, Stefansdottir L, Tragante V, Halldorsson BV, Petersen H, Thorgeirsson G, Thorsteinsdottir U, Sulem P, Hinriksdottir I, Jonsdottir I, Gudbjartsson DF, Stefansson K. Ivarsdottir EV, et al. Commun Biol. 2021 Jun 9;4(1):706. doi: 10.1038/s42003-021-02224-9. Commun Biol. 2021. PMID: 34108613 Free PMC article.
  • GraphTyper2 enables population-scale genotyping of structural variation using pangenome graphs.
    Eggertsson HP, Kristmundsdottir S, Beyter D, Jonsson H, Skuladottir A, Hardarson MT, Gudbjartsson DF, Stefansson K, Halldorsson BV, Melsted P. Eggertsson HP, et al. Nat Commun. 2019 Nov 27;10(1):5402. doi: 10.1038/s41467-019-13341-9. Nat Commun. 2019. PMID: 31776332 Free PMC article.
  • Rare SLC13A1 variants associate with intervertebral disc disorder highlighting role of sulfate in disc pathology.
    Bjornsdottir G, Stefansdottir L, Thorleifsson G, Sulem P, Norland K, Ferkingstad E, Oddsson A, Zink F, Lund SH, Nawaz MS, Bragi Walters G, Skuladottir AT, Gudjonsson SA, Einarsson G, Halldorsson GH, Bjarnadottir V, Sveinbjornsson G, Helgadottir A, Styrkarsdottir U, Gudmundsson LJ, Pedersen OB, Hansen TF, Werge T, Banasik K, Troelsen A, Skou ST, Thørner LW, Erikstrup C, Nielsen KR, Mikkelsen S; DBDS Genetic Consortium; GO Consortium; Jonsdottir I, Bjornsson A, Olafsson IH, Ulfarsson E, Blondal J, Vikingsson A, Brunak S, Ostrowski SR, Ullum H, Thorsteinsdottir U, Stefansson H, Gudbjartsson DF, Thorgeirsson TE, Stefansson K. Bjornsdottir G, et al. Nat Commun. 2022 Feb 2;13(1):634. doi: 10.1038/s41467-022-28167-1. Nat Commun. 2022. PMID: 35110524 Free PMC article.
  • DanMAC5: a browser of aggregated sequence variants from 8,671 whole genome sequenced Danish individuals.
    Banasik K, Møller PL, Techlo TR, Holm PC, Walters GB, Ingason A, Rosengren A, Rohde PD, Kogelman LJA, Westergaard D, Siggaard T, Chmura PJ, Chalmer MA, Magnússon ÓÞ, Þórisson GÁ, Stefánsson H, Guðbjartsson DF, Stefánsson K, Olesen J, Winther S, Bøttcher M, Brunak S, Werge T, Nyegaard M, Hansen TF. Banasik K, et al. BMC Genom Data. 2023 May 27;24(1):30. doi: 10.1186/s12863-023-01132-7. BMC Genom Data. 2023. PMID: 37244984 Free PMC article.
  • GWAS of bone size yields twelve loci that also affect height, BMD, osteoarthritis or fractures.
    Styrkarsdottir U, Stefansson OA, Gunnarsdottir K, Thorleifsson G, Lund SH, Stefansdottir L, Juliusson K, Agustsdottir AB, Zink F, Halldorsson GH, Ivarsdottir EV, Benonisdottir S, Jonsson H, Gylfason A, Norland K, Trajanoska K, Boer CG, Southam L, Leung JCS, Tang NLS, Kwok TCY, Lee JSW, Ho SC, Byrjalsen I, Center JR, Lee SH, Koh JM, Lohmander LS, Ho-Pham LT, Nguyen TV, Eisman JA, Woo J, Leung PC, Loughlin J, Zeggini E, Christiansen C, Rivadeneira F, van Meurs J, Uitterlinden AG, Mogensen B, Jonsson H, Ingvarsson T, Sigurdsson G, Benediktsson R, Sulem P, Jonsdottir I, Masson G, Holm H, Norddahl GL, Thorsteinsdottir U, Gudbjartsson DF, Stefansson K. Styrkarsdottir U, et al. Nat Commun. 2019 May 3;10(1):2054. doi: 10.1038/s41467-019-09860-0. Nat Commun. 2019. PMID: 31053729 Free PMC article.

References

Data Citations

    1. 2017. European Variation Archive. PRJEB15197
    1. 2017. European Variation Archive. PRJEB21300

References

    1. Gudbjartsson D. F. et al. Large-scale whole-genome sequencing of the Icelandic population. Nat. Genet. 47, 435–444 (2015). - PubMed
    1. The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015). - PMC - PubMed
    1. The UK10K Consortium. The UK10K project identifies rare variants in health and disease. Nature 526, 82–90 (2015). - PMC - PubMed
    1. Genome T. Whole-genome sequence variation, population structure and demographic history of the Dutch population. Nat. Genet. 46, 818–825 (2014). - PubMed
    1. Gurdasani D. et al. The African Genome Variation Project shapes medical genetics in Africa. Nature 517, 327–332 (2014). - PMC - PubMed

LinkOut - more resources