Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 10:38:101678.
doi: 10.1016/j.bbrep.2024.101678. eCollection 2024 Jul.

Integration of machine learning models with microsatellite markers: New avenue in world grapevine germplasm characterization

Affiliations

Integration of machine learning models with microsatellite markers: New avenue in world grapevine germplasm characterization

Hossein Abbasi Holasou et al. Biochem Biophys Rep. .

Abstract

Development of efficient analytical techniques is required for effective interpretation of biological data to take novel hypotheses and finding the critical predictive patterns. Machine Learning algorithms provide a novel opportunity for development of low-cost and practical solutions in biology. In this study, we proposed a new integrated analytical approach using supervised machine learning algorithms and microsatellites data of worldwide vitis populations. A total of 1378 wild (V. vinifera spp. sylvestris) and cultivated (V. vinifera spp. sativa) accessions of grapevine were investigated using 20 microsatellite markers. Data cleaning, feature selection, and supervised machine learning classification models vis, Naive Bayes, Support Vector Machine (SVM) and Tree Induction methods were implied to find most indicative and diagnostic alleles to represent wild/cultivated and originated geography of each population. Our combined approaches showed microsatellite markers with the highest differentiating capacity and proved efficiency for our pipeline of classification and prediction of vitis accessions. Moreover, our study proposed the best combination of markers for better distinguishing of populations, which can be exploited in future germplasm conservation and breeding programs.

Keywords: Feature selection; Machine learning; Microsatellites; Vitis.

PubMed Disclaimer

Conflict of interest statement

The authors declare there is not any conflict of interest.

Figures

Fig. 1
Fig. 1
Flowchart of the data analysis, which shows the structure of the analytical approach to the investigation of microsatellite (SSR) markers in this study.
Fig. 2
Fig. 2
Decision Tree generated model showing separation of wild and cultivated grape populations in the 2-targeted (2-t) experiment.
Fig. 3
Fig. 3
Decision Tree generated model showing separation of grape populations in the 9-targeted (9-t) experiment.

Similar articles

Cited by

References

    1. Panahi B., Afzal R., Ghorbanzadeh Neghab M., Mahmoodnia M., Paymard B. Relationship among AFLP, RAPD marker diversity and Agromorphological traits in safflower (Carthamus tinctorius L.) Prog. Biol. Sci. 2013;3(1):90–99.
    1. Panahi B., Neghab M.G. Genetic characterization of Iranian safflower (Carthamus tinctorius) using inter simple sequence repeats (ISSR) markers. Physiol. Mol. Biol. Plants. 2013;19(2):239–243. - PMC - PubMed
    1. Mahmoudi B., Panahi B., Mohammadi S.A., Daliri M., Babayev M.S. Microsatellite based phylogeny and bottleneck studies of Iranian indigenous goat populations. Anim. Biotechnol. 2014;25(3):210–222. - PubMed
    1. Ghorbanzadeh Neghab M., Panahi B. Molecular characterization of Iranian black cumin (Nigella sativa L.) accessions using RAPD marker. Biotechnologia. 2017;98(2):97–102.
    1. Abbasi Holasou H., Mohammadzadeh Jalaly H., Mohammadi R., Panahi B. Genetic diversity and structure of superior spring frost tolerant genotypes of Persian walnut (Juglans regia L.) in East Azerbaijan province of Iran, characterized using inter simple sequence repeat (ISSR) markers. Genet. Resour. Crop Evol. 2023;70(2):539–548.

LinkOut - more resources