Integration of machine learning models with microsatellite markers: New avenue in world grapevine germplasm characterization
- PMID: 38495412
- PMCID: PMC10940787
- DOI: 10.1016/j.bbrep.2024.101678
Integration of machine learning models with microsatellite markers: New avenue in world grapevine germplasm characterization
Abstract
Development of efficient analytical techniques is required for effective interpretation of biological data to take novel hypotheses and finding the critical predictive patterns. Machine Learning algorithms provide a novel opportunity for development of low-cost and practical solutions in biology. In this study, we proposed a new integrated analytical approach using supervised machine learning algorithms and microsatellites data of worldwide vitis populations. A total of 1378 wild (V. vinifera spp. sylvestris) and cultivated (V. vinifera spp. sativa) accessions of grapevine were investigated using 20 microsatellite markers. Data cleaning, feature selection, and supervised machine learning classification models vis, Naive Bayes, Support Vector Machine (SVM) and Tree Induction methods were implied to find most indicative and diagnostic alleles to represent wild/cultivated and originated geography of each population. Our combined approaches showed microsatellite markers with the highest differentiating capacity and proved efficiency for our pipeline of classification and prediction of vitis accessions. Moreover, our study proposed the best combination of markers for better distinguishing of populations, which can be exploited in future germplasm conservation and breeding programs.
Keywords: Feature selection; Machine learning; Microsatellites; Vitis.
© 2024 The Authors.
Conflict of interest statement
The authors declare there is not any conflict of interest.
Figures



Similar articles
-
Machine Learning Based Classification of Microsatellite Variation: An Effective Approach for Phylogeographic Characterization of Olive Populations.PLoS One. 2015 Nov 24;10(11):e0143465. doi: 10.1371/journal.pone.0143465. eCollection 2015. PLoS One. 2015. PMID: 26599001 Free PMC article.
-
Genetic diversity of wild and cultivated grapevine accessions from southeast Turkey.Hereditas. 2014 Oct;151(4-5):73-80. doi: 10.1111/hrd2.00039. Hereditas. 2014. PMID: 25363274
-
Genetic diversity and population structure assessed by SSR and SNP markers in a large germplasm collection of grape.BMC Plant Biol. 2013 Mar 7;13:39. doi: 10.1186/1471-2229-13-39. BMC Plant Biol. 2013. PMID: 23497049 Free PMC article.
-
Recent advances in biotechnological studies on wild grapevines as valuable resistance sources for smart viticulture.Mol Biol Rep. 2020 Apr;47(4):3141-3153. doi: 10.1007/s11033-020-05363-0. Epub 2020 Mar 4. Mol Biol Rep. 2020. PMID: 32130616 Review.
-
Georgian Grapevine Cultivars: Ancient Biodiversity for Future Viticulture.Front Plant Sci. 2021 Feb 5;12:630122. doi: 10.3389/fpls.2021.630122. eCollection 2021. Front Plant Sci. 2021. PMID: 33613611 Free PMC article. Review.
Cited by
-
Assessment of potato varieties of Lithuanian breeding resistance potato wart causative agents and late blight.Sci Rep. 2025 Feb 18;15(1):5915. doi: 10.1038/s41598-025-85526-w. Sci Rep. 2025. PMID: 39966506 Free PMC article.
References
-
- Panahi B., Afzal R., Ghorbanzadeh Neghab M., Mahmoodnia M., Paymard B. Relationship among AFLP, RAPD marker diversity and Agromorphological traits in safflower (Carthamus tinctorius L.) Prog. Biol. Sci. 2013;3(1):90–99.
-
- Mahmoudi B., Panahi B., Mohammadi S.A., Daliri M., Babayev M.S. Microsatellite based phylogeny and bottleneck studies of Iranian indigenous goat populations. Anim. Biotechnol. 2014;25(3):210–222. - PubMed
-
- Ghorbanzadeh Neghab M., Panahi B. Molecular characterization of Iranian black cumin (Nigella sativa L.) accessions using RAPD marker. Biotechnologia. 2017;98(2):97–102.
-
- Abbasi Holasou H., Mohammadzadeh Jalaly H., Mohammadi R., Panahi B. Genetic diversity and structure of superior spring frost tolerant genotypes of Persian walnut (Juglans regia L.) in East Azerbaijan province of Iran, characterized using inter simple sequence repeat (ISSR) markers. Genet. Resour. Crop Evol. 2023;70(2):539–548.
LinkOut - more resources
Full Text Sources
Miscellaneous