Genomic prediction with machine learning in sugarcane, a complex highly polyploid clonally propagated crop with substantial non-additive variation for key traits
- PMID: 37728221
- DOI: 10.1002/tpg2.20390
Genomic prediction with machine learning in sugarcane, a complex highly polyploid clonally propagated crop with substantial non-additive variation for key traits
Abstract
Sugarcane has a complex, highly polyploid genome with multi-species ancestry. Additive models for genomic prediction of clonal performance might not capture interactions between genes and alleles from different ploidies and ancestral species. As such, genomic prediction in sugarcane presents an interesting case for machine learning (ML) methods, which are purportedly able to deal with high levels of complexity in prediction. Here, we investigated deep learning (DL) neural networks, including multilayer networks (MLP) and convolution neural networks (CNN), and an ensemble machine learning approach, random forest (RF), for genomic prediction in sugarcane. The data set used was 2912 sugarcane clones, scored for 26,086 genome wide single nucleotide polymorphism markers, with final assessment trial data for total cane harvested (TCH), commercial cane sugar (CCS), and fiber content (Fiber). The clones in the latest trial (2017) were used as a validation set. We compared prediction accuracy of these methods to genomic best linear unbiased prediction (GBLUP) extended to include dominance and epistatic effects. The prediction accuracies from GBLUP models were up to 0.37 for TCH, 0.43 for CCS, and 0.48 for Fiber, while the optimized ML models had prediction accuracies of 0.35 for TCH, 0.38 for CCS, and 0.48 for Fiber. Both RF and DL neural network models have comparable predictive ability with the additive GBLUP model but are less accurate than the extended GBLUP model.
© 2023 The Authors. The Plant Genome published by Wiley Periodicals LLC on behalf of Crop Science Society of America.
Similar articles
-
Improved genomic prediction of clonal performance in sugarcane by exploiting non-additive genetic effects.Theor Appl Genet. 2021 Jul;134(7):2235-2252. doi: 10.1007/s00122-021-03822-1. Epub 2021 Apr 26. Theor Appl Genet. 2021. PMID: 33903985 Free PMC article.
-
Machine learning for genomic and pedigree prediction in sugarcane.Plant Genome. 2024 Sep;17(3):e20486. doi: 10.1002/tpg2.20486. Epub 2024 Jun 26. Plant Genome. 2024. PMID: 38923818
-
Accuracy of genomic prediction of complex traits in sugarcane.Theor Appl Genet. 2021 May;134(5):1455-1462. doi: 10.1007/s00122-021-03782-6. Epub 2021 Feb 15. Theor Appl Genet. 2021. PMID: 33590303
-
Genomic Selection in Sugarcane: Current Status and Future Prospects.Front Plant Sci. 2021 Sep 27;12:708233. doi: 10.3389/fpls.2021.708233. eCollection 2021. Front Plant Sci. 2021. PMID: 34646284 Free PMC article. Review.
-
Sugarcane improvement: how far can we go?Curr Opin Biotechnol. 2012 Apr;23(2):265-70. doi: 10.1016/j.copbio.2011.09.002. Epub 2011 Oct 7. Curr Opin Biotechnol. 2012. PMID: 21983270 Review.
Cited by
-
Genomic prediction for sugarcane diseases including hybrid Bayesian-machine learning approaches.Front Plant Sci. 2024 May 1;15:1398903. doi: 10.3389/fpls.2024.1398903. eCollection 2024. Front Plant Sci. 2024. PMID: 38751840 Free PMC article.
References
REFERENCES
-
- Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., & Devin, M. (2016). Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467.
-
- Abdollahi-Arpanahi, R., Gianola, D., & Peñagaricano, F. (2020). Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes. Genetics Selection Evolution, 52(1), 12. https://doi.org/10.1186/s12711-020-00531-z
-
- Aitken, K., Farmer, A., Berkman, P., Muller, C., Wei, X., Demano, E., Jackson, P., Magwire, M., Dietrich, B., & Kota, R. (2016). Generation of a 345K sugarcane SNP chip. Proceedings of the Australian Society of Sugar Cane Technologists, 29, 1165-1172.
-
- Ali, M., Zhang, Y., Rasheed, A., Wang, J., & Zhang, L. (2020). Genomic prediction for grain yield and yield-related traits in Chinese winter wheat. International Journal of Molecular Sciences, 21(4), 1342. https://doi.org/10.3390/ijms21041342
-
- Azodi, C. B., Bolger, E., Mccarren, A., Roantree, M., De Los Campos, G., & Shiu, S.-H. (2019). Benchmarking parametric and machine learning models for genomic prediction of complex traits. G3 Genes|Genomes|Genetics, 9(11), 3691-3702. https://doi.org/10.1534/g3.119.400498
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials