Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Jan 6;22(1):19.
doi: 10.1186/s12864-020-07319-x.

A review of deep learning applications for genomic selection

Affiliations
Review

A review of deep learning applications for genomic selection

Osval Antonio Montesinos-López et al. BMC Genomics. .

Abstract

Background: Several conventional genomic Bayesian (or no Bayesian) prediction methods have been proposed including the standard additive genetic effect model for which the variance components are estimated with mixed model equations. In recent years, deep learning (DL) methods have been considered in the context of genomic prediction. The DL methods are nonparametric models providing flexibility to adapt to complicated associations between data and output with the ability to adapt to very complex patterns.

Main body: We review the applications of deep learning (DL) methods in genomic selection (GS) to obtain a meta-picture of GS performance and highlight how these tools can help solve challenging plant breeding problems. We also provide general guidance for the effective use of DL methods including the fundamentals of DL and the requirements for its appropriate use. We discuss the pros and cons of this technique compared to traditional genomic prediction approaches as well as the current trends in DL applications.

Conclusions: The main requirement for using DL is the quality and sufficiently large training data. Although, based on current literature GS in plant and animal breeding we did not find clear superiority of DL in terms of prediction power compared to conventional genome based prediction models. Nevertheless, there are clear evidences that DL algorithms capture nonlinear patterns more efficiently than conventional genome based. Deep learning algorithms are able to integrate data from different sources as is usually needed in GS assisted breeding and it shows the ability for improving prediction accuracy for large plant breeding data. It is important to apply DL to large training-testing data sets.

Keywords: Deep learning; Genomic selection; Genomic trends; Plant breeding.

PubMed Disclaimer

Conflict of interest statement

Not applicable.

Figures

Fig. 1
Fig. 1
A five-layer feedforward deep neural network with one input layer, four hidden layers and one output layer. There are eight neurons in the input layer that corresponds to the input information, four neurons in the first three hidden layers, three neurons in the fourth hidden layer and three neurons in the output layer that corresponds to the traits that will be predicted
Fig. 2
Fig. 2
A simple two-layer recurrent artificial neural network with univariate outcome (a). Max pooling with 2 × 2 filters and stride 1 (b)
Fig. 3
Fig. 3
Convolutional neural network
Fig. 4
Fig. 4
Training set, tuning set and testing set (adapted from Singh et al., 2018)
Fig. 5
Fig. 5
Histograms of the AUC criterion and their standard deviation (error bars) for the wheat (a) and maize (b) datasets. a: grain yield (GY) in seven environments (1–7) of classifiers MLP and PNN of the upper 15 and 30% classes; b: grain yield (GY) under optimal conditions (HI and WW) and stress conditions (LO and SS) of classifiers MLP and PNN in the upper 15 and 30% classes
Fig. 6
Fig. 6
Pearson’s correlation across environments for the GBLUP and the DL model. The first vertical sub-panel corresponds to the model with genotype × environment interaction (I), and the second vertical sub-panel corresponds to the same model but without genotype × environment interaction (WI) (Montesinos-López et al., 2018a)

Similar articles

Cited by

References

    1. He T, Li C. Harness the power of genomic selection and the potential of germplasm in crop breeding for global food security in the era with rapid climate change. Crop J. 2020;8(5):688–700.
    1. Qaim M. Role of new plant breeding Technologies for Food Security and Sustainable Agricultural Development. Appl Economic Perspect Policy. 2020;42(2):129–150. doi: 10.1002/aepp.13044. - DOI
    1. Bernardo R. Prediction of maize single-cross performance using RFLPs and information from related hybrids. Crop Sci. 1994;34:20–25. doi: 10.2135/cropsci1994.0011183X003400010003x. - DOI
    1. Meuwissen THE, Hayes BJ, Goddard ME. Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001;157:1819–1829. - PMC - PubMed
    1. Bernardo R. Molecular markers and selection for complex traits in plants: learning from the last 20 years. Crop Sci. 2008;48:1649–1664. doi: 10.2135/cropsci2008.03.0131. - DOI

LinkOut - more resources