Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Sep 21:4:170115.
doi: 10.1038/sdata.2017.115.

Whole genome characterization of sequence diversity of 15,220 Icelanders

Affiliations

Whole genome characterization of sequence diversity of 15,220 Icelanders

Hákon Jónsson et al. Sci Data. .

Abstract

Understanding of sequence diversity is the cornerstone of analysis of genetic disorders, population genetics, and evolutionary biology. Here, we present an update of our sequencing set to 15,220 Icelanders who we sequenced to an average genome-wide coverage of 34X. We identified 39,020,168 autosomal variants passing GATK filters: 31,079,378 SNPs and 7,940,790 indels. Calling de novo mutations (DNMs) is a formidable challenge given the high false positive rate in sequencing datasets relative to the mutation rate. Here we addressed this issue by using segregation of alleles in three-generation families. Using this transmission assay, we controlled the false positive rate and identified 108,778 high quality DNMs. Furthermore, we used our extended family structure and read pair tracing of DNMs to a panel of phased SNPs, to determine the parent of origin of 42,961 DNMs.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Figure 1
Figure 1. Schematic overview of the DNM characterization.
Figure 2
Figure 2. The GAM model predicted response for all DNM candidates.
The red line corresponds to the 0.8 GAM response requirement for the high quality DNMs.
Figure 3
Figure 3. The fraction of discordant DNMs between MZ twins.
There were used 91 monozygotic twin pairs for the discordance calculation. The discordance fraction was calculated as the fraction of the proband’s high quality DNMs not found in the MZ twin.

Dataset use reported in

  • doi: 10.1038/nature24018

Similar articles

Cited by

  • Start codon variant in LAG3 is associated with decreased LAG-3 expression and increased risk of autoimmune thyroid disease.
    Saevarsdottir S, Bjarnadottir K, Markusson T, Berglund J, Olafsdottir TA, Halldorsson GH, Rutsdottir G, Gunnarsdottir K, Arnthorsson AO, Lund SH, Stefansdottir L, Gudmundsson J, Johannesson AJ, Sturluson A, Oddsson A, Halldorsson B, Ludviksson BR, Ferkingstad E, Ivarsdottir EV, Sveinbjornsson G, Grondal G, Masson G, Eldjarn GH, Thorisson GA, Kristjansdottir K, Knowlton KU, Moore KHS, Gudjonsson SA, Rognvaldsson S, Knight S, Nadauld LD, Holm H, Magnusson OT, Sulem P, Gudbjartsson DF, Rafnar T, Thorleifsson G, Melsted P, Norddahl GL, Jonsdottir I, Stefansson K. Saevarsdottir S, et al. Nat Commun. 2024 Jul 9;15(1):5748. doi: 10.1038/s41467-024-50007-7. Nat Commun. 2024. PMID: 38982041 Free PMC article.
  • Loss-of-function variants in ITSN1 confer high risk of Parkinson's disease.
    Skuladottir AT, Tragante V, Sveinbjornsson G, Helgason H, Sturluson A, Bjornsdottir A, Jonsson P, Palmadottir V, Sveinsson OA, Jensson BO, Gudjonsson SA, Ivarsdottir EV, Gisladottir RS, Gunnarsson AF, Walters GB, Jonsdottir GA, Thorgeirsson TE, Bjornsdottir G, Holm H, Gudbjartsson DF, Sulem P, Stefansson H, Stefansson K. Skuladottir AT, et al. NPJ Parkinsons Dis. 2024 Aug 15;10(1):140. doi: 10.1038/s41531-024-00752-9. NPJ Parkinsons Dis. 2024. PMID: 39147844 Free PMC article.
  • Genetics and epidemiology of mutational barcode-defined clonal hematopoiesis.
    Stacey SN, Zink F, Halldorsson GH, Stefansdottir L, Gudjonsson SA, Einarsson G, Hjörleifsson G, Eiriksdottir T, Helgadottir A, Björnsdottir G, Thorgeirsson TE, Olafsdottir TA, Jonsdottir I, Gretarsdottir S, Tragante V, Magnusson MK, Jonsson H, Gudmundsson J, Olafsson S, Holm H, Gudbjartsson DF, Sulem P, Helgason A, Thorsteinsdottir U, Tryggvadottir L, Rafnar T, Melsted P, Ulfarsson MÖ, Vidarsson B, Thorleifsson G, Stefansson K. Stacey SN, et al. Nat Genet. 2023 Dec;55(12):2149-2159. doi: 10.1038/s41588-023-01555-z. Epub 2023 Nov 6. Nat Genet. 2023. PMID: 37932435 Free PMC article.
  • PopDel identifies medium-size deletions simultaneously in tens of thousands of genomes.
    Niehus S, Jónsson H, Schönberger J, Björnsson E, Beyter D, Eggertsson HP, Sulem P, Stefánsson K, Halldórsson BV, Kehr B. Niehus S, et al. Nat Commun. 2021 Feb 1;12(1):730. doi: 10.1038/s41467-020-20850-5. Nat Commun. 2021. PMID: 33526789 Free PMC article.
  • A genome-wide meta-analysis yields 46 new loci associating with biomarkers of iron homeostasis.
    Bell S, Rigas AS, Magnusson MK, Ferkingstad E, Allara E, Bjornsdottir G, Ramond A, Sørensen E, Halldorsson GH, Paul DS, Burgdorf KS, Eggertsson HP, Howson JMM, Thørner LW, Kristmundsdottir S, Astle WJ, Erikstrup C, Sigurdsson JK, Vuckovic D, Dinh KM, Tragante V, Surendran P, Pedersen OB, Vidarsson B, Jiang T, Paarup HM, Onundarson PT, Akbari P, Nielsen KR, Lund SH, Juliusson K, Magnusson MI, Frigge ML, Oddsson A, Olafsson I, Kaptoge S, Hjalgrim H, Runarsson G, Wood AM, Jonsdottir I, Hansen TF, Sigurdardottir O, Stefansson H, Rye D; DBDS Genomic Consortium; Peters JE, Westergaard D, Holm H, Soranzo N, Banasik K, Thorleifsson G, Ouwehand WH, Thorsteinsdottir U, Roberts DJ, Sulem P, Butterworth AS, Gudbjartsson DF, Danesh J, Brunak S, Di Angelantonio E, Ullum H, Stefansson K. Bell S, et al. Commun Biol. 2021 Feb 3;4(1):156. doi: 10.1038/s42003-020-01575-z. Commun Biol. 2021. PMID: 33536631 Free PMC article. Review.

References

Data Citations

    1. 2017. European Variation Archive. PRJEB15197
    1. 2017. European Variation Archive. PRJEB21300

References

    1. Gudbjartsson D. F. et al. Large-scale whole-genome sequencing of the Icelandic population. Nat. Genet. 47, 435–444 (2015). - PubMed
    1. The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015). - PMC - PubMed
    1. The UK10K Consortium. The UK10K project identifies rare variants in health and disease. Nature 526, 82–90 (2015). - PMC - PubMed
    1. Genome T. Whole-genome sequence variation, population structure and demographic history of the Dutch population. Nat. Genet. 46, 818–825 (2014). - PubMed
    1. Gurdasani D. et al. The African Genome Variation Project shapes medical genetics in Africa. Nature 517, 327–332 (2014). - PMC - PubMed

LinkOut - more resources