High-Fidelity Nanopore Sequencing of Ultra-Short DNA Targets

doi:10.1021/acs.analchem.9b00856

. 2019 May 21;91(10):6783-6789.

doi: 10.1021/acs.analchem.9b00856. Epub 2019 May 10.

High-Fidelity Nanopore Sequencing of Ultra-Short DNA Targets

Brandon D Wilson¹, Michael Eisenstein^{2

3}, H Tom Soh^{2

3

4}

Affiliations

¹ Department of Chemical Engineering , Stanford University , Stanford , California 94305 , United States.
² Department of Electrical Engineering , Stanford University , Stanford , California 94305 , United States.
³ Department of Radiology , Stanford University , Stanford , California 94305 , United States.
⁴ Chan Zuckerberg Biohub , San Francisco , California 94158 , United States.

PMID: 31038923
PMCID: PMC6533607
DOI: 10.1021/acs.analchem.9b00856

High-Fidelity Nanopore Sequencing of Ultra-Short DNA Targets

Brandon D Wilson et al. Anal Chem. 2019.

. 2019 May 21;91(10):6783-6789.

doi: 10.1021/acs.analchem.9b00856. Epub 2019 May 10.

Authors

Brandon D Wilson¹, Michael Eisenstein^{2

3}, H Tom Soh^{2

3

4}

Affiliations

¹ Department of Chemical Engineering , Stanford University , Stanford , California 94305 , United States.
² Department of Electrical Engineering , Stanford University , Stanford , California 94305 , United States.
³ Department of Radiology , Stanford University , Stanford , California 94305 , United States.
⁴ Chan Zuckerberg Biohub , San Francisco , California 94158 , United States.

PMID: 31038923
PMCID: PMC6533607
DOI: 10.1021/acs.analchem.9b00856

Abstract

Nanopore sequencing offers a portable and affordable alternative to sequencing-by-synthesis methods but suffers from lower accuracy and cannot sequence ultrashort DNA. This puts applications such as molecular diagnostics based on the analysis of cell-free DNA or single-nucleotide variants (SNVs) out of reach. To overcome these limitations, we report a nanopore-based sequencing strategy in which short target sequences are first circularized and then amplified via rolling-circle amplification to produce long stretches of concatemeric repeats. After sequencing on the Oxford Nanopore Technologies MinION platform, the resulting repeat sequences can be aligned to produce a highly accurate consensus that reduces the high error-rate present in the individual repeats. Using this approach, we demonstrate for the first time the ability to obtain unbiased and accurate nanopore data for target DNA sequences <100 bp. Critically, this approach is sensitive enough to achieve SNV discrimination in mixtures of sequences and even enables quantitative detection of specific variants present at ratios of <10%. Our method is simple, cost-effective, and only requires well-established processes. It therefore expands the utility of nanopore sequencing for molecular diagnostics and other applications, especially in resource-limited settings.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

**Figure 1**
Sequencing ultrashort reads on the MinION. (a) (1) Molecular inversion probes (MIPs) anneal adjacent to the target sequence (blue) at anchor site 1 (AS1, orange) and anchor site 2 (AS2, green). Phusion polymerase copies the target sequence into the MIP; the lack of 5′ → 3′ exonuclease activity ensures that extension halts when the polymerase reaches AS2. (2) Ampligase ligates the extended template to the phosphorylated 5′ end of the MIP, generating circular ssDNA. Linear ss- or dsDNA fragments are degraded by a combination of exonuclease I and exonuclease III. (3) The circular DNA is subjected to RCA to generate tandem repeats of the original target, yielding ultralong, concatemerized ssDNA. (4) The RCA product is converted to dsDNA with Taq polymerase and subjected to ONT library preparation. (5) Sequencing reads are collected from a new MinION R9.4 flow-cell run for 24 h. (b) The raw sequences are compiled and analyzed. The identified repeats have poor accuracy in isolation, but since the sequencing errors vary across repeats, they can be aligned together to produce a high-fidelity consensus sequence.

**Figure 2**
Circularization can be performed on target sequences as short as a single nucleotide with reaction efficiency that is independent of target sequence length. After five rounds of temperature cycling and subsequent exonuclease treatment, we achieve consistently efficient circularization for target sequences ranging in length from 1 to 120 nt (lanes 2–8). In this denaturing gel, lane 1 contains a mixture of all the linear ssDNA target sequences. The lengths listed are the lengths of the target region; the full lengths in lane 1 have additional flanking 28- and 23-nt anchor sites, and the full lengths in lanes 2–8 have an additional 102 nt from the MIP. Lane 9 illustrates that no circular DNA is produced in the absence of the target sequence.

**Figure 3**
Improving read accuracy through repeat-based consensus. (a) Representative consensus sequence generation. A single, base-called read is split into its individual repeats. These repeats are aligned with each other to generate a consensus sequence via a winner-take-all base-calling strategy. Gaps are removed and the consensus sequence is then compared back to the original sequence to assess the postalignment accuracy. (b) Histogram of alignment scores before (gray) and after (red) consensus sequence generation. The “before” alignment score is an average over the alignment scores of all the repeats found within a single raw read. Data includes all reads with more than three identified repeats, regardless of the quality score or pass/fail designation of the MinION software.

**Figure 4**
Increased accuracy from alignment of tandem repeats. Plots show normalized Smith-Waterman alignment scores as a function of the number of repeats before (gray) and after (red) alignment. Before consensus sequence generation, alignment score exhibits no dependence on repeat count. Since each “before point” represents an average over all repeats in that read, the observed narrowing arises solely because the increased number of repeats decreases the standard deviation of the average alignment score. After the consensus sequence is generated, the alignment accuracy exhibits a strong dependence on the number of repeats used.

**Figure 5**
Quantitative analysis with HiFRe. a) Counting relative molecular abundance for two sequences present in mixtures at different ratios. A linear fit to y = mx + b yielded m = 0.97 ± 0.04 and b = 0.02 ± 0.02 with R² = 0.993. This strong linear relationship results in a limit of detection of 3.3 ± 2.1%. b) Discrimination and quantitation of SNVs in short DNA sequences. Two sequences differing by three SNVs were mixed together in different ratios, and the plot shows the output ratios recovered after HiFRe analysis. Green bars represent the fraction of the original sequence and yellow bars show the sequence with three SNVs. In both panels, the error bars represent the standard deviation for the mean of two multiplexed sequencing runs.

See this image and copyright information in PMC

Cited by

Rapid in situ identification of biological specimens via DNA amplicon sequencing using miniaturized laboratory equipment.
Pomerantz A, Sahlin K, Vasiljevic N, Seah A, Lim M, Humble E, Kennedy S, Krehenwinkel H, Winter S, Ogden R, Prost S. Pomerantz A, et al. Nat Protoc. 2022 Jun;17(6):1415-1443. doi: 10.1038/s41596-022-00682-x. Epub 2022 Apr 11. Nat Protoc. 2022. PMID: 35411044 Review.
Opportunities and challenges in long-read sequencing data analysis.
Amarasinghe SL, Su S, Dong X, Zappia L, Ritchie ME, Gouil Q. Amarasinghe SL, et al. Genome Biol. 2020 Feb 7;21(1):30. doi: 10.1186/s13059-020-1935-5. Genome Biol. 2020. PMID: 32033565 Free PMC article. Review.
In vivo hypermutation and continuous evolution.
Molina RS, Rix G, Mengiste AA, Alvarez B, Seo D, Chen H, Hurtado J, Zhang Q, Donato García-García J, Heins ZJ, Almhjell PJ, Arnold FH, Khalil AS, Hanson AD, Dueber JE, Schaffer DV, Chen F, Kim S, Ángel Fernández L, Shoulders MD, Liu CC. Molina RS, et al. Nat Rev Methods Primers. 2022;2:37. doi: 10.1038/s43586-022-00130-w. Epub 2022 May 19. Nat Rev Methods Primers. 2022. PMID: 37073402 Free PMC article. No abstract available.
Long-read human genome sequencing and its applications.
Logsdon GA, Vollger MR, Eichler EE. Logsdon GA, et al. Nat Rev Genet. 2020 Oct;21(10):597-614. doi: 10.1038/s41576-020-0236-x. Epub 2020 Jun 5. Nat Rev Genet. 2020. PMID: 32504078 Free PMC article. Review.
Genetic Biomonitoring and Biodiversity Assessment Using Portable Sequencing Technologies: Current Uses and Future Directions.
Krehenwinkel H, Pomerantz A, Prost S. Krehenwinkel H, et al. Genes (Basel). 2019 Oct 29;10(11):858. doi: 10.3390/genes10110858. Genes (Basel). 2019. PMID: 31671909 Free PMC article. Review.

See all "Cited by" articles

References

1. Jain M.; Koren S.; Miga K. H.; Quick J.; Rand A. C.; Sasani T. A.; Tyson J. R.; Beggs A. D.; Dilthey A. T.; Fiddes I. T.; et al. Nanopore Sequencing and Assembly of a Human Genome with Ultra-Long Reads. Nat. Biotechnol. 2018, 36 (4), 338–345. 10.1038/nbt.4060. - DOI - PMC - PubMed
1. Michael T. P.; Jupe F.; Bemm F.; Motley S. T.; Sandoval J. P.; Lanz C.; Loudet O.; Weigel D.; Ecker J. R. High Contiguity Arabidopsis Thaliana Genome Assembly with a Single Nanopore Flow Cell. Nat. Commun. 2018, 9 (1), 1–8. 10.1038/s41467-018-03016-2. - DOI - PMC - PubMed
1. Mitchell P. S.; Parkin R. K.; Kroh E. M.; Fritz B. R.; Wyman S. K.; Pogosova-Agadjanyan E. L.; Peterson A.; Noteboom J.; Briant K. C. O.; Allen A.; et al. Circulating MicroRNAs as Stable Blood-Based Markers for Cancer Detection. Proc. Natl. Acad. Sci. U. S. A. 2008, 105 (30), 10513–10518. 10.1073/pnas.0804549105. - DOI - PMC - PubMed
1. Christensen E.; Nordentoft I.; Vang S.; Birkenkamp-Demtröder K.; Jensen J. B.; Agerbæk M.; Pedersen J. S.; Dyrskjøt L. Optimized Targeted Sequencing of Cell-Free Plasma DNA from Bladder Cancer Patients. Sci. Rep. 2018, 8 (1), 1–11. 10.1038/s41598-018-20282-8. - DOI - PMC - PubMed
1. Krishnakumar R.; Sinha A.; Bird S. W.; Jayamohan H.; Edwards H. S.; Schoeniger J. S.; Patel K. D.; Branda S. S.; Bartsch M. S. Systematic and Stochastic Influences on the Performance of the MinION Nanopore Sequencer across a Range of Nucleotide Bias. Sci. Rep. 2018, 8 (1), 1–13. 10.1038/s41598-018-21484-w. - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

[1] Jain M.; Koren S.; Miga K. H.; Quick J.; Rand A. C.; Sasani T. A.; Tyson J. R.; Beggs A. D.; Dilthey A. T.; Fiddes I. T.; et al. Nanopore Sequencing and Assembly of a Human Genome with Ultra-Long Reads. Nat. Biotechnol. 2018, 36 (4), 338–345. 10.1038/nbt.4060. - DOI - PMC - PubMed

[2] Jain M.; Koren S.; Miga K. H.; Quick J.; Rand A. C.; Sasani T. A.; Tyson J. R.; Beggs A. D.; Dilthey A. T.; Fiddes I. T.; et al. Nanopore Sequencing and Assembly of a Human Genome with Ultra-Long Reads. Nat. Biotechnol. 2018, 36 (4), 338–345. 10.1038/nbt.4060. - DOI - PMC - PubMed

[3] Michael T. P.; Jupe F.; Bemm F.; Motley S. T.; Sandoval J. P.; Lanz C.; Loudet O.; Weigel D.; Ecker J. R. High Contiguity Arabidopsis Thaliana Genome Assembly with a Single Nanopore Flow Cell. Nat. Commun. 2018, 9 (1), 1–8. 10.1038/s41467-018-03016-2. - DOI - PMC - PubMed

[4] Michael T. P.; Jupe F.; Bemm F.; Motley S. T.; Sandoval J. P.; Lanz C.; Loudet O.; Weigel D.; Ecker J. R. High Contiguity Arabidopsis Thaliana Genome Assembly with a Single Nanopore Flow Cell. Nat. Commun. 2018, 9 (1), 1–8. 10.1038/s41467-018-03016-2. - DOI - PMC - PubMed

[5] Mitchell P. S.; Parkin R. K.; Kroh E. M.; Fritz B. R.; Wyman S. K.; Pogosova-Agadjanyan E. L.; Peterson A.; Noteboom J.; Briant K. C. O.; Allen A.; et al. Circulating MicroRNAs as Stable Blood-Based Markers for Cancer Detection. Proc. Natl. Acad. Sci. U. S. A. 2008, 105 (30), 10513–10518. 10.1073/pnas.0804549105. - DOI - PMC - PubMed

[6] Mitchell P. S.; Parkin R. K.; Kroh E. M.; Fritz B. R.; Wyman S. K.; Pogosova-Agadjanyan E. L.; Peterson A.; Noteboom J.; Briant K. C. O.; Allen A.; et al. Circulating MicroRNAs as Stable Blood-Based Markers for Cancer Detection. Proc. Natl. Acad. Sci. U. S. A. 2008, 105 (30), 10513–10518. 10.1073/pnas.0804549105. - DOI - PMC - PubMed

[7] Christensen E.; Nordentoft I.; Vang S.; Birkenkamp-Demtröder K.; Jensen J. B.; Agerbæk M.; Pedersen J. S.; Dyrskjøt L. Optimized Targeted Sequencing of Cell-Free Plasma DNA from Bladder Cancer Patients. Sci. Rep. 2018, 8 (1), 1–11. 10.1038/s41598-018-20282-8. - DOI - PMC - PubMed

[8] Christensen E.; Nordentoft I.; Vang S.; Birkenkamp-Demtröder K.; Jensen J. B.; Agerbæk M.; Pedersen J. S.; Dyrskjøt L. Optimized Targeted Sequencing of Cell-Free Plasma DNA from Bladder Cancer Patients. Sci. Rep. 2018, 8 (1), 1–11. 10.1038/s41598-018-20282-8. - DOI - PMC - PubMed

[9] Krishnakumar R.; Sinha A.; Bird S. W.; Jayamohan H.; Edwards H. S.; Schoeniger J. S.; Patel K. D.; Branda S. S.; Bartsch M. S. Systematic and Stochastic Influences on the Performance of the MinION Nanopore Sequencer across a Range of Nucleotide Bias. Sci. Rep. 2018, 8 (1), 1–13. 10.1038/s41598-018-21484-w. - DOI - PMC - PubMed

[10] Krishnakumar R.; Sinha A.; Bird S. W.; Jayamohan H.; Edwards H. S.; Schoeniger J. S.; Patel K. D.; Branda S. S.; Bartsch M. S. Systematic and Stochastic Influences on the Performance of the MinION Nanopore Sequencer across a Range of Nucleotide Bias. Sci. Rep. 2018, 8 (1), 1–13. 10.1038/s41598-018-21484-w. - DOI - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

High-Fidelity Nanopore Sequencing of Ultra-Short DNA Targets

Affiliations

High-Fidelity Nanopore Sequencing of Ultra-Short DNA Targets

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources