Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Apr 20;23(1):142.
doi: 10.1186/s12859-022-04686-y.

RODAN: a fully convolutional architecture for basecalling nanopore RNA sequencing data

Affiliations

RODAN: a fully convolutional architecture for basecalling nanopore RNA sequencing data

Don Neumann et al. BMC Bioinformatics. .

Abstract

Background: Despite recent progress in basecalling of Oxford nanopore DNA sequencing data, its wide adoption is still being hampered by its relatively low accuracy compared to short read technologies. Furthermore, very little of the recent research was focused on basecalling of RNA data, which has different characteristics than its DNA counterpart.

Results: We fill this gap by benchmarking a fully convolutional deep learning basecalling architecture with improved performance compared to Oxford nanopore's RNA basecallers.

Availability: The source code for our basecaller is available at: https://github.com/biodlab/RODAN .

Keywords: Convolutional networks; Long read sequencing; Oxford nanopore; RNA basecalling.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
The RODAN architecture. The normalized signal is passed through a succession of convolutional blocks which gradually incorporate surrounding information. Each block is composed of several processing steps (convolution, activation, batch normalization etc.), which are standard building blocks in the construction of deep neural networks. The final output is passed through a fully connected layer to produce the decoded sequence of nucleotides
Fig. 2
Fig. 2
Read statistics. For each of the five datasets we show histograms of read length in (a), and basecalling calling accuracy as a function of read length

Similar articles

Cited by

References

    1. Amarasinghe SL, Su S, Dong X, Zappia L, Ritchie ME, Gouil Q. Opportunities and challenges in long-read sequencing data analysis. Genome Biol. 2020;21(1):1–16. doi: 10.1186/s13059-020-1935-5. - DOI - PMC - PubMed
    1. Garalde DR, Snell EA, Jachimowicz D, Sipos B, Lloyd JH, Bruce M, Pantic N, Admassu T, James P, Warland A, Jordan M, Ciccone J, Serra S, Keenan J, Martin S, McNeill LE, Wallace J, Jayasinghe L, Wright C, Blasco J, Young S, Brocklebank D, Juul S, Clarke J, Turner DJ. Highly parallel direct RNA sequencing on an array of nanopores. Nat Methods. 2018;15(3):201. doi: 10.1038/nmeth.4577. - DOI - PubMed
    1. Wick RR, Judd LM, Holt KE. Performance of neural network basecalling tools for oxford nanopore sequencing. Genome Biol. 2019;20(1):1–10. doi: 10.1186/s13059-019-1727-y. - DOI - PMC - PubMed
    1. Teng H, Cao MD, Hall MB, Duarte T, Wang S, Coin LJ. Chiron: translating nanopore raw signal directly into nucleotide sequence using deep learning. GigaScience. 2018;7(5):037. doi: 10.1093/gigascience/giy037. - DOI - PMC - PubMed
    1. Boža V, Brejová B, Vinař T. DeepNano: deep recurrent neural networks for base calling in minion nanopore reads. PloS One. 2017;12(6):0178751. doi: 10.1371/journal.pone.0178751. - DOI - PMC - PubMed

LinkOut - more resources