Ultraaccurate genome sequencing and haplotyping of single human cells
- PMID: 29078313
- PMCID: PMC5703283
- DOI: 10.1073/pnas.1707609114
Ultraaccurate genome sequencing and haplotyping of single human cells
Abstract
Accurate detection of variants and long-range haplotypes in genomes of single human cells remains very challenging. Common approaches require extensive in vitro amplification of genomes of individual cells using DNA polymerases and high-throughput short-read DNA sequencing. These approaches have two notable drawbacks. First, polymerase replication errors could generate tens of thousands of false-positive calls per genome. Second, relatively short sequence reads contain little to no haplotype information. Here we report a method, which is dubbed SISSOR (single-stranded sequencing using microfluidic reactors), for accurate single-cell genome sequencing and haplotyping. A microfluidic processor is used to separate the Watson and Crick strands of the double-stranded chromosomal DNA in a single cell and to randomly partition megabase-size DNA strands into multiple nanoliter compartments for amplification and construction of barcoded libraries for sequencing. The separation and partitioning of large single-stranded DNA fragments of the homologous chromosome pairs allows for the independent sequencing of each of the complementary and homologous strands. This enables the assembly of long haplotypes and reduction of sequence errors by using the redundant sequence information and haplotype-based error removal. We demonstrated the ability to sequence single-cell genomes with error rates as low as 10-8 and average 500-kb-long DNA fragments that can be assembled into haplotype contigs with N50 greater than 7 Mb. The performance could be further improved with more uniform amplification and more accurate sequence alignment. The ability to obtain accurate genome sequences and haplotype information from single cells will enable applications of genome sequencing for diverse clinical needs.
Keywords: haplotyping; microfluidics; mutation detection; single-cell sequencing.
Copyright © 2017 the Author(s). Published by PNAS.
Conflict of interest statement
Conflict of interest statement: X.H. and K.Z. are listed as inventors for a patent application related to the method disclosed in this manuscript. K.Z. is a cofounder and equity holder of Singlera Genomics Inc. V. Bafna is a cofounder, has an equity interest, and receives income from Digital Proteomics, LLC. The terms of this arrangement have been reviewed and approved by the University of California, San Diego in accordance with its conflict of interest policies. Digital Proteomics, LLC was not involved in the research presented here.
Figures




Comment in
-
Haplotype resolution at the single-cell level.Proc Natl Acad Sci U S A. 2017 Nov 21;114(47):12362-12364. doi: 10.1073/pnas.1717798114. Epub 2017 Nov 7. Proc Natl Acad Sci U S A. 2017. PMID: 29114045 Free PMC article. No abstract available.
References
-
- Ke X, Taylor MS, Cardon LR. Singleton SNPs in the human genome and implications for genome-wide association studies. Eur J Hum Genet. 2008;16:506–515. - PubMed
-
- Collins FS, Brooks LD, Chakravarti A. A DNA polymorphism discovery resource for research on human genetic variation. Genome Res. 1998;8:1229–1231. - PubMed
-
- Wheeler DA, et al. The complete genome of an individual by massively parallel DNA sequencing. Nature. 2008;452:872–876. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous