Prediction of maize single-cross hybrid performance: support vector machine regression versus best linear prediction
- PMID: 19904522
- DOI: 10.1007/s00122-009-1200-5
Prediction of maize single-cross hybrid performance: support vector machine regression versus best linear prediction
Abstract
Accurate prediction of the phenotypic performance of a hybrid plant based on the molecular fingerprints of its parents should lead to a more cost-effective breeding programme as it allows to reduce the number of expensive field evaluations. The construction of a reliable prediction model requires a representative sample of hybrids for which both molecular and phenotypic information are accessible. This phenotypic information is usually readily available as typical breeding programmes test numerous new hybrids in multi-location field trials on a yearly basis. Earlier studies indicated that a linear mixed model analysis of this typically unbalanced phenotypic data allows to construct epsilon-insensitive support vector machine regression and best linear prediction models for predicting the performance of single-cross maize hybrids. We compare these prediction methods using different subsets of the phenotypic and marker data of a commercial maize breeding programme and evaluate the resulting prediction accuracies by means of a specifically designed field experiment. This balanced field trial allows to assess the reliability of the cross-validation prediction accuracies reported here and in earlier studies. The limits of the predictive capabilities of both prediction methods are further examined by reducing the number of training hybrids and the size of the molecular fingerprints. The results indicate a considerable discrepancy between prediction accuracies obtained by cross-validation procedures and those obtained by correlating the predictions with the results of a validation field trial. The prediction accuracy of best linear prediction was less sensitive to a reduction of the number of training examples compared with that of support vector machine regression. The latter was, however, better at predicting hybrid performance when the size of the molecular fingerprints was reduced, especially if the initial set of markers had a low information content.
Similar articles
-
Support vector machine regression for the prediction of maize hybrid performance.Theor Appl Genet. 2007 Nov;115(7):1003-13. doi: 10.1007/s00122-007-0627-9. Epub 2007 Sep 6. Theor Appl Genet. 2007. PMID: 17849095
-
Genome properties and prospects of genomic prediction of hybrid performance in a breeding program of maize.Genetics. 2014 Aug;197(4):1343-55. doi: 10.1534/genetics.114.165860. Epub 2014 May 21. Genetics. 2014. PMID: 24850820 Free PMC article.
-
Novel strategies for genomic prediction of untested single-cross maize hybrids using unbalanced historical data.Theor Appl Genet. 2020 Feb;133(2):443-455. doi: 10.1007/s00122-019-03475-1. Epub 2019 Nov 22. Theor Appl Genet. 2020. PMID: 31758202
-
Prediction of hybrid performance in maize using molecular markers and joint analyses of hybrids and parental inbreds.Theor Appl Genet. 2010 Jan;120(2):451-61. doi: 10.1007/s00122-009-1208-x. Epub 2009 Nov 15. Theor Appl Genet. 2010. PMID: 19916002
-
Prediction of hybrid performance in maize with a ridge regression model employed to DNA markers and mRNA transcription profiles.BMC Genomics. 2016 Mar 29;17:262. doi: 10.1186/s12864-016-2580-y. BMC Genomics. 2016. PMID: 27025377 Free PMC article.
Cited by
-
Performance prediction of F1 hybrids between recombinant inbred lines derived from two elite maize inbred lines.Theor Appl Genet. 2013 Jan;126(1):189-201. doi: 10.1007/s00122-012-1973-9. Epub 2012 Sep 13. Theor Appl Genet. 2013. PMID: 22972201
-
Identification of heterotic loci associated with grain yield and its components using two CSSL test populations in maize.Sci Rep. 2016 Dec 5;6:38205. doi: 10.1038/srep38205. Sci Rep. 2016. PMID: 27917917 Free PMC article.
-
Genomic prediction of hybrid performance in maize with models incorporating dominance and population specific marker effects.Theor Appl Genet. 2012 Oct;125(6):1181-94. doi: 10.1007/s00122-012-1905-8. Epub 2012 Jun 26. Theor Appl Genet. 2012. PMID: 22733443
-
Heterotic loci identified for maize kernel traits in two chromosome segment substitution line test populations.Sci Rep. 2018 Jul 23;8(1):11101. doi: 10.1038/s41598-018-29338-1. Sci Rep. 2018. PMID: 30038303 Free PMC article.
-
Prediction of hybrid biomass in Arabidopsis thaliana by selected parental SNP and metabolic markers.Theor Appl Genet. 2010 Jan;120(2):239-47. doi: 10.1007/s00122-009-1191-2. Epub 2009 Nov 13. Theor Appl Genet. 2010. PMID: 19911163 Free PMC article.
References
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources