Deep learning in next-generation sequencing

Bertil Schmidt¹, Andreas Hildebrandt²

Affiliations

¹ Institut für Informatik, Johannes Gutenberg University Mainz, Germany. Electronic address: bertil.schmidt@uni-mainz.de.
² Institut für Informatik, Johannes Gutenberg University Mainz, Germany. Electronic address: andreas.hildebrandt@uni-mainz.de.

PMID: 33059075
PMCID: PMC7550123
DOI: 10.1016/j.drudis.2020.10.002

Review

Deep learning in next-generation sequencing

Bertil Schmidt et al. Drug Discov Today. 2021 Jan.

. 2021 Jan;26(1):173-180.

doi: 10.1016/j.drudis.2020.10.002. Epub 2020 Oct 12.

Authors

Bertil Schmidt¹, Andreas Hildebrandt²

Affiliations

¹ Institut für Informatik, Johannes Gutenberg University Mainz, Germany. Electronic address: bertil.schmidt@uni-mainz.de.
² Institut für Informatik, Johannes Gutenberg University Mainz, Germany. Electronic address: andreas.hildebrandt@uni-mainz.de.

PMID: 33059075
PMCID: PMC7550123
DOI: 10.1016/j.drudis.2020.10.002

Abstract

Next-generation sequencing (NGS) methods lie at the heart of large parts of biological and medical research. Their fundamental importance has created a continuously increasing demand for processing and analysis methods of the data sets produced, addressing questions such as variant calling, metagenomic classification and quantification, genomic feature detection, or downstream analysis in larger biological or medical contexts. In addition to classical algorithmic approaches, machine-learning (ML) techniques are often used for such tasks. In particular, deep learning (DL) methods that use multilayered artificial neural networks (ANNs) for supervised, semisupervised, and unsupervised learning have gained significant traction for such applications. Here, we highlight important network architectures, application areas, and DL frameworks in a NGS context.

PubMed Disclaimer

Figures

**Figure 1**
Overview of ANN architectures: **(a)** An artificial neuron maps an input vector *x_i*, 0≤i≤n, to a scalar output y by applying a nonlinear activation function φ to a weighted sum $s : = \sum_{i = 0}^{n} w_{i} x_{i} = {\vec{w}}^{t} \vec{x}$ . **(b)** A multilayer perceptron (MLP) comprising an input layer, a fully connected hidden layer, and an output layer. **(c)** A single layer of a convolutional neural network (CNN), where matrix multiplication is replaced by a convolution with a small filter kernel matrix, the entries of which are learned during training followed by a ReLu activation function and (max)pooling. **(d)** Recurrent neural networks (RNNs) feature feedback connections to earlier layers and can be trained to learn time-dependent relations. **(e)** Autoencoders (AEs) are designed to identify useful data encodings in an unsupervised setting. **(f)** Generative adversarial networks (GANs) train two networks simultaneously. The generator produces new data points, whereas the discriminator classifies data points as either genuine or fake.

See this image and copyright information in PMC

References

1. Mavrou A. Serine–arginine protein kinase 1 (SRPK1) inhibition as a potential novel targeted therapeutic strategy in prostate cancer. Oncogene. 2015;34:4311–4319. - PMC - PubMed
1. Stephens Z.D. Big data: astronomical or genomical? PLoS Biol. 2015;13:e1002195. - PMC - PubMed
1. Harper A.R., Topol E.J. Pharmacogenomics in clinical practice and drug development. Nat. Biotechnol. 2012;30:1117–1124. - PMC - PubMed
1. Heerboth S. Use of epigenetic drugs in disease: an overview. Genet. Epigenet. 2014;6:9–19. - PMC - PubMed
1. Tang X. On the origin and continuing evolution of SARS-CoV-2. National Sci. Rev. 2020;7:1012–1023. - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Deep learning in next-generation sequencing

Affiliations

Deep learning in next-generation sequencing

Authors

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources