Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jul;13(7):587-90.
doi: 10.1038/nmeth.3865. Epub 2016 May 9.

A hybrid approach for de novo human genome sequence assembly and phasing

Affiliations

A hybrid approach for de novo human genome sequence assembly and phasing

Yulia Mostovoy et al. Nat Methods. 2016 Jul.

Abstract

Despite tremendous progress in genome sequencing, the basic goal of producing a phased (haplotype-resolved) genome sequence with end-to-end contiguity for each chromosome at reasonable cost and effort is still unrealized. In this study, we describe an approach to performing de novo genome assembly and experimental phasing by integrating the data from Illumina short-read sequencing, 10X Genomics linked-read sequencing, and BioNano Genomics genome mapping to yield a high-quality, phased, de novo assembled human genome.

PubMed Disclaimer

Conflict of interest statement

Competing Financial Interests Statement

E.T.L., A.R.H., J. Lee, Ž. DŽ., H.C. are employees of BioNano Genomics. P.M., K.G., M.S.L. are employees of 10X Genomics, and P.Y.K. is on the scientific advisory board of BioNano Genomics.

Figures

Figure 1
Figure 1
Flowchart depicting genome sequence assembly strategy.
Figure 2
Figure 2. Schematic from the UCSC Genome Browser showing the relative sizes of scaffolds produced during each step of the assembly process, as well as haplotype blocks, for the hybrid scaffold (64 Mb) aligned to the q arm of reference chromosome X
(a) Assembly based on short-read Illumina ata filtered for scaffolds longer than 3 kb; (b) the short-read assembly scaffolded together using barcode information from 10XG data; (c) assembled BNG genome maps; (d) hybrid scaffold produced by merging b and c; (e) barcode-based haplotype blocks for this region; (f) dot plot of the region against reference genome hg38.
Figure 3
Figure 3. Alignment and phasing of the hybrid assembly
(a) Ideograms of the hybrid scaffold assembly aligned to the reference genome hg38, with each colored block representing an assembled scaffold. (b) A 23-Mb phase block (super scaffold 259, aligned to Chr 3 50 Mb-73 Mb) at increasing resolution showing the alleles on the two haplotypes (green vertical line: assembly allele; grey vertical line, alternate allele). Where a green or grey vertical line is not matched with a corresponding mark, the allele is indeterminate on that haplotype.

References

    1. Wheeler DA, Wang L. From human genome to cancer genome: the first decade. Genome Res. 2013;23:1054–1062. - PMC - PubMed
    1. Duncan E, Brown M, Shore EM. The revolution in human monogenic disease mapping. Genes. 2014;5:792–803. - PMC - PubMed
    1. Li R, et al. The sequence and de novo assembly of the giant panda genome. Nature. 2010;463:311–317. - PMC - PubMed
    1. Tattini L, D’Aurizio R, Magi A. Detection of genomic structural variants from next-generation sequencing data. Front Bioeng Biotechnol. 2015;3:92. - PMC - PubMed
    1. Cao H, et al. De novo assembly of a haplotype-resolved human genome. Nat Biotechnol. 2015;33:617–622. - PubMed

Publication types