Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Dec 11;37(24):4661-4667.
doi: 10.1093/bioinformatics/btab528.

Nanopore base calling on the edge

Affiliations

Nanopore base calling on the edge

Peter Perešíni et al. Bioinformatics. .

Abstract

Motivation: MinION is a portable nanopore sequencing device that can be easily operated in the field with features including monitoring of run progress and selective sequencing. To fully exploit these features, real-time base calling is required. Up to date, this has only been achieved at the cost of high computing requirements that pose limitations in terms of hardware availability in common laptops and energy consumption.

Results: We developed a new base caller DeepNano-coral for nanopore sequencing, which is optimized to run on the Coral Edge Tensor Processing Unit, a small USB-attached hardware accelerator. To achieve this goal, we have designed new versions of two key components used in convolutional neural networks for speech recognition and base calling. In our components, we propose a new way of factorization of a full convolution into smaller operations, which decreases memory access operations, memory access being a bottleneck on this device. DeepNano-coral achieves real-time base calling during sequencing with the accuracy slightly better than the fast mode of the Guppy base caller and is extremely energy efficient, using only 10 W of power.

Availability and implementation: https://github.com/fmfi-compbio/coral-basecaller.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Bonito CNN-based architecture. The architecture is composed of high-level blocks depicted as colored rectangles, where the output of the previous block serves as an input of the following block. The neural network is composed of three blocks of type C, five blocks of type B and a Decoder block. The construction of these block types is depicted on the right; each block type is composed of standard building blocks used in deep learning
Fig. 2.
Fig. 2.
Receptive fields for basic types of convolutions. For each convolution type, the two rows represent input and output data of the convolution. Multiple channels are stacked. The colored value in the output tensor is computed from the values of the same color in the input tensor
Fig. 3.
Fig. 3.
Comparison of different convolution factorizations. Individual filled squares represent input values convolved with a weight tensor to a single output or intermediate value. Our k-blueprint-separable convolutions introduce a novel combination of basic blocks. (a) Cin=Cout=128 and tensor size (4, 1668, 128) and (b) Cin=Cout=256 and tensor size (4, 556, 256)
Fig. 4.
Fig. 4.
Inference time of different convolutions on the Coral device. Note that pointwise corresponds to a full convolution with depth = 1
Fig. 5.
Fig. 5.
Residual block with depth-to-space compression (right) is a drop-in replacement for a regular Bonito B-type block (left). The first separable convolution is replaced by a compression block, which compresses the input tensor dimensions from (T, C) to (T/x,Cy). Subsequent separable convolutions use depth reduced by a factor of x and channels increased by a factor of y. The final separable convolution is replaced by a decompression block
Fig. 6.
Fig. 6.
Speed-vers us-accuracy frontier for various architectures and depthwise kernel sizes. Each point represents the tradeoff for a given kernel size

References

    1. Alser M. et al. (2021) SneakySnake: a fast and accurate universal genome pre-alignment filter for CPUs, GPUs and FPGAs. Bioinformatics, 36, 5282–5290. - PubMed
    1. Boza V. et al. (2020) DeepNano-blitz: a fast base caller for MinION nanopore sequencers. Bioinformatics, 36, 4191–4192. - PubMed
    1. Cali D.S. et al. (2020) GenASM: a high-performance, low-power approximate string matching acceleration framework for genome sequence analysis. In: 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), pp. 951–966. IEEE, Athens, Greece.
    1. Fujiki D. et al. (2018) Genax: a genome sequencing accelerator. In: 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture (ISCA), pp. 69–82. IEEE, Los Angeles, CA, USA.
    1. Glorot X., Bengio Y. (2010) Understanding the difficulty of training deep feedforward neural networks. In: Teh Y.W., Titterington D.M. (eds.) Proceedings of the 13th International Conference on Artificial Intelligence and Statistics (AISTATS), Vol. 9 of JMLR Proceedings, Sardinia, Italy, JMLR.org, pp. 249–256.

Publication types