Partial least squares regression, support vector machine regression, and transcriptome-based distances for prediction of maize hybrid performance with gene expression data
- PMID: 22101908
- DOI: 10.1007/s00122-011-1747-9
Partial least squares regression, support vector machine regression, and transcriptome-based distances for prediction of maize hybrid performance with gene expression data
Abstract
The performance of hybrids can be predicted with gene expression data from their parental inbred lines. Implementing such prediction approaches in breeding programs promises to increase the efficiency of hybrid breeding. The objectives of our study were to compare the accuracy of prediction models employing multiple linear regression (MLR), partial least squares regression (PLS), support vector machine regression (SVM), and transcriptome-based distances (D(B)). For a factorial of 7 flint and 14 dent maize lines, the grain yield of the hybrids was assessed and the gene expression of the parental lines was profiled with a 56k microarray. The accuracy of the prediction models was measured by the correlation between predicted and observed yield employing two cross-validation schemes. The first modeled the prediction of hybrids when testcross data are available for both parental lines (type 2 hybrids), and the second modeled the prediction of hybrids when no testcross data for the parental lines were available (type 0 hybrids). MLR, SVM, and PLS resulted in a high correlation between predicted and observed yield for type 2 hybrids, whereas for type 0 hybrids D(B) had greater prediction accuracy. The regression methods were robust to the choice of the set of profiled genes and required only a few hundred genes. In contrast, for an accurate hybrid prediction with D(B), 1,000-1,500 genes were required, and the prediction accuracy depended strongly on the set of profiled genes. We conclude that for prediction within one set of genetic material MLR is a promising approach, and for transferring prediction models from one set of genetic material to a related one, the transcriptome-based distance D(B) is most promising.
Similar articles
-
Genome properties and prospects of genomic prediction of hybrid performance in a breeding program of maize.Genetics. 2014 Aug;197(4):1343-55. doi: 10.1534/genetics.114.165860. Epub 2014 May 21. Genetics. 2014. PMID: 24850820 Free PMC article.
-
Transcriptome-based distance measures for grouping of germplasm and prediction of hybrid performance in maize.Theor Appl Genet. 2010 Jan;120(2):441-50. doi: 10.1007/s00122-009-1204-1. Epub 2009 Nov 13. Theor Appl Genet. 2010. PMID: 19911157
-
Prediction of single-cross hybrid performance in maize using haplotype blocks associated with QTL for grain yield.Theor Appl Genet. 2007 May;114(8):1345-55. doi: 10.1007/s00122-007-0521-5. Epub 2007 Feb 24. Theor Appl Genet. 2007. PMID: 17323040
-
Prediction of hybrid performance in maize with a ridge regression model employed to DNA markers and mRNA transcription profiles.BMC Genomics. 2016 Mar 29;17:262. doi: 10.1186/s12864-016-2580-y. BMC Genomics. 2016. PMID: 27025377 Free PMC article.
-
Breeding drought-tolerant maize hybrids for the US corn-belt: discovery to product.J Exp Bot. 2014 Nov;65(21):6191-204. doi: 10.1093/jxb/eru064. Epub 2014 Mar 4. J Exp Bot. 2014. PMID: 24596174 Review.
Cited by
-
Phenomic selection in wheat breeding: identification and optimisation of factors influencing prediction accuracy and comparison to genomic selection.Theor Appl Genet. 2022 Mar;135(3):895-914. doi: 10.1007/s00122-021-04005-8. Epub 2022 Jan 6. Theor Appl Genet. 2022. PMID: 34988629
-
Heterosis Breeding in Eggplant (Solanum melongena L.): Gains and Provocations.Plants (Basel). 2020 Mar 24;9(3):403. doi: 10.3390/plants9030403. Plants (Basel). 2020. PMID: 32213925 Free PMC article. Review.
-
High-dimensional multi-omics measured in controlled conditions are useful for maize platform and field trait predictions.Theor Appl Genet. 2024 Jul 3;137(7):175. doi: 10.1007/s00122-024-04679-w. Theor Appl Genet. 2024. PMID: 38958724
-
Phenomic selection in wheat breeding: prediction of the genotype-by-environment interaction in multi-environment breeding trials.Theor Appl Genet. 2022 Oct;135(10):3337-3356. doi: 10.1007/s00122-022-04170-4. Epub 2022 Aug 8. Theor Appl Genet. 2022. PMID: 35939074
-
Big data and artificial intelligence-aided crop breeding: Progress and prospects.J Integr Plant Biol. 2025 Mar;67(3):722-739. doi: 10.1111/jipb.13791. Epub 2024 Oct 28. J Integr Plant Biol. 2025. PMID: 39467106 Free PMC article. Review.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous