Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Aug 3;548(7665):87-91.
doi: 10.1038/nature23264. Epub 2017 Jul 26.

Sequencing and de novo assembly of 150 genomes from Denmark as a population reference

Lasse Maretty  1 Jacob Malte Jensen  2   3 Bent Petersen  4 Jonas Andreas Sibbesen  1 Siyang Liu  1   5 Palle Villesen  2   3   6 Laurits Skov  2   3 Kirstine Belling  4 Christian Theil Have  7 Jose M G Izarzugaza  4 Marie Grosjean  4 Jette Bork-Jensen  7 Jakob Grove  3   8   9 Thomas D Als  3   8   9 Shujia Huang  10   11 Yuqi Chang  10 Ruiqi Xu  5 Weijian Ye  5 Junhua Rao  5 Xiaosen Guo  10   12 Jihua Sun  5   7 Hongzhi Cao  10 Chen Ye  10 Johan van Beusekom  4 Thomas Espeseth  13   14 Esben Flindt  12 Rune M Friborg  2   3 Anders E Halager  2   3 Stephanie Le Hellard  14   15 Christina M Hultman  16 Francesco Lescai  3   8   9 Shengting Li  3   8   9 Ole Lund  4 Peter Løngren  4 Thomas Mailund  2   3 Maria Luisa Matey-Hernandez  4 Ole Mors  3   6   9 Christian N S Pedersen  2   3 Thomas Sicheritz-Pontén  4 Patrick Sullivan  16   17 Ali Syed  4 David Westergaard  4 Rachita Yadav  4 Ning Li  5 Xun Xu  10 Torben Hansen  7 Anders Krogh  1 Lars Bolund  8   10 Thorkild I A Sørensen  7   18   19 Oluf Pedersen  7 Ramneek Gupta  4 Simon Rasmussen  4 Søren Besenbacher  2   6 Anders D Børglum  3   8   9 Jun Wang  3   10   12 Hans Eiberg  20 Karsten Kristiansen  10   12 Søren Brunak  4   21 Mikkel Heide Schierup  2   3   22
Affiliations
Free article

Sequencing and de novo assembly of 150 genomes from Denmark as a population reference

Lasse Maretty et al. Nature. .
Free article

Abstract

Hundreds of thousands of human genomes are now being sequenced to characterize genetic variation and use this information to augment association mapping studies of complex disorders and other phenotypic traits. Genetic variation is identified mainly by mapping short reads to the reference genome or by performing local assembly. However, these approaches are biased against discovery of structural variants and variation in the more complex parts of the genome. Hence, large-scale de novo assembly is needed. Here we show that it is possible to construct excellent de novo assemblies from high-coverage sequencing with mate-pair libraries extending up to 20 kilobases. We report de novo assemblies of 150 individuals (50 trios) from the GenomeDenmark project. The quality of these assemblies is similar to those obtained using the more expensive long-read technology. We use the assemblies to identify a rich set of structural variants including many novel insertions and demonstrate how this variant catalogue enables further deciphering of known association mapping signals. We leverage the assemblies to provide 100 completely resolved major histocompatibility complex haplotypes and to resolve major parts of the Y chromosome. Our study provides a regional reference genome that we expect will improve the power of future association mapping studies and hence pave the way for precision medicine initiatives, which now are being launched in many countries including Denmark.

PubMed Disclaimer

References

    1. Genome Res. 2014 Apr;24(4):688-96 - PubMed
    1. Bioinformatics. 2009 Jul 15;25(14):1754-60 - PubMed
    1. Clin Genet. 1989 May;35(5):313-21 - PubMed
    1. Nature. 2015 Oct 1;526(7571):75-81 - PubMed
    1. Nat Genet. 2015 May;47(5):435-44 - PubMed

Publication types