Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 May 18;10(1):8195.
doi: 10.1038/s41598-020-65011-2.

Regularized selection indices for breeding value prediction using hyper-spectral image data

Affiliations

Regularized selection indices for breeding value prediction using hyper-spectral image data

Marco Lopez-Cruz et al. Sci Rep. .

Abstract

High-throughput phenotyping (HTP) technologies can produce data on thousands of phenotypes per unit being monitored. These data can be used to breed for economically and environmentally relevant traits (e.g., drought tolerance); however, incorporating high-dimensional phenotypes in genetic analyses and in breeding schemes poses important statistical and computational challenges. To address this problem, we developed regularized selection indices; the methodology integrates techniques commonly used in high-dimensional phenotypic regressions (including penalization and rank-reduction approaches) into the selection index (SI) framework. Using extensive data from CIMMYT's (International Maize and Wheat Improvement Center) wheat breeding program we show that regularized SIs derived from hyper-spectral data offer consistently higher accuracy for grain yield than those achieved by standard SIs, and by vegetation indices commonly used to predict agronomic traits. Regularized SIs offer an effective approach to leverage HTP data that is routinely generated in agriculture; the methodology can also be used to conduct genetic studies using high-dimensional phenotypes that are often collected in humans and model organisms including body images and whole-genome gene expression profiles.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Prediction of the genetic merit for grain yield using hyper-spectral crop image data. (A) Data consists of hyper-spectral reflectance data (xi) and phenotypic measurements of the target trait (yi, e.g., grain yield). (B) A subset of the data (the training set) is used to derive the coefficients (β) of a selection index. (C) These coefficients are then applied to image data of individuals in the testing set to derive the index (Ii) for each individual. The predictive ability of the index is assessed by calculating the accuracy of indirect selection (Acc(I)) in the testing set.
Figure 2
Figure 2
Accuracy of indirect selection of regularized SIs and its components. Square root heritability (green), genetic correlation (orange), and accuracy of indirect selection (purple, all averaged over 100 training-testing partitions), versus the number of predictors used to build the index: (A) number of active bands in the case of the L1-PSI, or (B) number of PCs in the PC-SI. Each panel represents one environment (latest time-point).
Figure 3
Figure 3
Accuracy of indirect selection achieved by a standard (SI) and by regularized (PC-SI and L1-PSI) selection indices. The lines provide the average accuracy over 100 training-testing partitions. Vertical lines represent a 95% confidence interval for the average. The horizontal axis gives the time-point at which images were collected and are expressed in both days after sowing (DAS) and stages (VEG = vegetative, GF = grain filling, MAT = maturity).
Figure 4
Figure 4
Heatmap of regression coefficients for L1-penalized selection indices. Separate indices were derived for each environment using multi time-point data. DAS = days after sowing, VEG, GF, MAT represent vegetative, grain-filling and maturity stages, respectively. The bottom color-bar shows the light color associated with each waveband in the visible spectrum (≤750  m); black was used to represent the near-infrared spectrum (wavelength > 750 nm).

References

    1. Nagel KA, et al. GROWSCREEN-Rhizo is a novel phenotyping robot enabling simultaneous measurements of root and shoot growth for plants grown in soil-filled rhizotrons. Funct. Plant Biol. 2012;39:891–904. doi: 10.1071/FP12023. - DOI - PubMed
    1. Araus L, Cairns JE. Field high-throughput phenotyping: the new crop breeding frontier. Trends Plant Sci. 2014;19:52–61. doi: 10.1016/j.tplants.2013.09.008. - DOI - PubMed
    1. Montes JM, et al. Near-infrared spectroscopy on combine harvesters to measure maize grain dry matter content and quality parameters. Plant Breed. 2006;125:591–595. doi: 10.1111/j.1439-0523.2006.01298.x. - DOI
    1. White JW, et al. Field Crops Research Field-based phenomics for plant genetics research. F. Crop. Res. 2012;133:101–112. doi: 10.1016/j.fcr.2012.04.003. - DOI
    1. Ferrio JP, et al. Assessment of durum wheat yield using visible and near-infrared reflectance spectra of canopies. F. Crop. Res. 2005;94:126–148. doi: 10.1016/j.fcr.2004.12.002. - DOI

Publication types

LinkOut - more resources