Whole genome characterization of sequence diversity of 15,220 Icelanders
- PMID: 28933420
- PMCID: PMC5607473
- DOI: 10.1038/sdata.2017.115
Whole genome characterization of sequence diversity of 15,220 Icelanders
Abstract
Understanding of sequence diversity is the cornerstone of analysis of genetic disorders, population genetics, and evolutionary biology. Here, we present an update of our sequencing set to 15,220 Icelanders who we sequenced to an average genome-wide coverage of 34X. We identified 39,020,168 autosomal variants passing GATK filters: 31,079,378 SNPs and 7,940,790 indels. Calling de novo mutations (DNMs) is a formidable challenge given the high false positive rate in sequencing datasets relative to the mutation rate. Here we addressed this issue by using segregation of alleles in three-generation families. Using this transmission assay, we controlled the false positive rate and identified 108,778 high quality DNMs. Furthermore, we used our extended family structure and read pair tracing of DNMs to a panel of phased SNPs, to determine the parent of origin of 42,961 DNMs.
Conflict of interest statement
The authors declare no competing financial interests.
Figures



Dataset use reported in
- doi: 10.1038/nature24018
Similar articles
-
Parental influence on human germline de novo mutations in 1,548 trios from Iceland.Nature. 2017 Sep 28;549(7673):519-522. doi: 10.1038/nature24018. Epub 2017 Sep 20. Nature. 2017. PMID: 28959963
-
Sequence variants from whole genome sequencing a large group of Icelanders.Sci Data. 2015 Mar 25;2:150011. doi: 10.1038/sdata.2015.11. eCollection 2015. Sci Data. 2015. PMID: 25977816 Free PMC article.
-
Comprehensive de novo mutation discovery with HiFi long-read sequencing.Genome Med. 2023 May 8;15(1):34. doi: 10.1186/s13073-023-01183-6. Genome Med. 2023. PMID: 37158973 Free PMC article.
-
mirTrios: an integrated pipeline for detection of de novo and rare inherited mutations from trios-based next-generation sequencing.J Med Genet. 2015 Apr;52(4):275-81. doi: 10.1136/jmedgenet-2014-102656. Epub 2015 Jan 16. J Med Genet. 2015. PMID: 25596308
-
Complete genome phasing of family quartet by combination of genetic, physical and population-based phasing analysis.PLoS One. 2013 May 31;8(5):e64571. doi: 10.1371/journal.pone.0064571. Print 2013. PLoS One. 2013. PMID: 23741343 Free PMC article.
Cited by
-
Start codon variant in LAG3 is associated with decreased LAG-3 expression and increased risk of autoimmune thyroid disease.Nat Commun. 2024 Jul 9;15(1):5748. doi: 10.1038/s41467-024-50007-7. Nat Commun. 2024. PMID: 38982041 Free PMC article.
-
Loss-of-function variants in ITSN1 confer high risk of Parkinson's disease.NPJ Parkinsons Dis. 2024 Aug 15;10(1):140. doi: 10.1038/s41531-024-00752-9. NPJ Parkinsons Dis. 2024. PMID: 39147844 Free PMC article.
-
Genetics and epidemiology of mutational barcode-defined clonal hematopoiesis.Nat Genet. 2023 Dec;55(12):2149-2159. doi: 10.1038/s41588-023-01555-z. Epub 2023 Nov 6. Nat Genet. 2023. PMID: 37932435 Free PMC article.
-
PopDel identifies medium-size deletions simultaneously in tens of thousands of genomes.Nat Commun. 2021 Feb 1;12(1):730. doi: 10.1038/s41467-020-20850-5. Nat Commun. 2021. PMID: 33526789 Free PMC article.
-
A genome-wide meta-analysis yields 46 new loci associating with biomarkers of iron homeostasis.Commun Biol. 2021 Feb 3;4(1):156. doi: 10.1038/s42003-020-01575-z. Commun Biol. 2021. PMID: 33536631 Free PMC article. Review.
References
Data Citations
-
- 2017. European Variation Archive. PRJEB15197
-
- 2017. European Variation Archive. PRJEB21300
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources