Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 16;21(1):62.
doi: 10.1186/s13007-025-01383-8.

Machine Learning-Based identification of resistance genes associated with sunflower broomrape

Affiliations

Machine Learning-Based identification of resistance genes associated with sunflower broomrape

Yingxue Che et al. Plant Methods. .

Abstract

Background: Sunflowers (Helianthus annuus L.), a vital oil crop, are facing a severe challenge from broomrape (Orobanche cumana), a parasitic plant that seriously jeopardizes the growth and development of sunflowers, limits global production and leads to substantial economic losses, which urges the development of resistant sunflower varieties.

Results: This study aims to identify resistance genes from a comprehensive transcriptomic profile of 103 sunflower varieties based on gene expression data and then constructs predictive models with the key resistant genes. The least absolute shrinkage and selection operator (LASSO) regression and random forest feature importance ranking method were used to identify resistance genes. These genes were considered as biomarkers in constructing machine learning models with Support Vector Machine (SVM), K-Nearest Neighbours (KNN), Logistic Regression (LR), and Gaussian Naive Bayes (GaussianNB). The SVM model constructed with the 24 key genes selected by the LASSO method demonstrated high classification accuracy (0.9514) and a robust AUC value (0.9865), effectively distinguishing between resistant and susceptible varieties based on gene expression data. Furthermore, we discovered a correlation between key genes and differential metabolites, particularly jasmonic acid (JA).

Conclusion: Our study highlights a novel perspective on screening sunflower varieties for broomrape resistance, which is anticipated to guide future biological research and breeding strategies.

Keywords: Feature selection; Machine learning; Resistance genes; Sunflower broomrape.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Sunflower phenotypes and results of WGCNA analyses. A Schematic diagram of the phenotypes of resistant and susceptible varieties after infestation with broomrape and pictures of roots parasitized by broomrape. B Bar plot showing the number of susceptible and resistant varieties. C Plot of phenotypes of different classifications. D Volcano plot of differential genes of edgeR analysis. Blue represents the down-regulation of gene expression, and red represents the up-regulation of gene expression. E The scale-free fit index for various soft-thresholding powers (β) and the mean connectivity for various soft-thresholding powers. F Dendrogram of genes clustered via the dissimilarity measure. G Heatmap of the correlation between module eigengenes and traits. Grey module indicates genes that do not match any module. H Scatter plot between GS and MM in the brown module
Fig. 2
Fig. 2
Performance of models constructed using shared genes. A A total of 32 shared genes were identified between key module genes and DEGs. B The GO enrichment results from shared genes, GO terms with False Discovery Rate (FDR) < 0.05 were considered significant. C Model prediction performance of train dataset. D Model prediction performance of test dataset
Fig. 3
Fig. 3
Prediction model was constructed using key genes identified through the LASSO method. A LASSO regression algorithm. B Number of features for different lambda values. C Model prediction performance of training dataset. D Model prediction performance of test dataset. E Expression of 24 key genes in the two groups. *, **, and *** represent statistical significance at p < 0.05, p < 0.01, and p < 0.001, respectively
Fig. 4
Fig. 4
Prediction model constructed by Random Forest importance rank. A The top 30 feature importance ranking. B Model prediction performance of training dataset. C Model prediction performance of test dataset. D, E Partial dependency graph of the top 2 features of the selected features. F The contour plot visually demonstrates the influence of the top 2 genes on the predicted values of LR
Fig. 5
Fig. 5
The performance of the machine learning model. A The performance of the machine learning model on the training datasets. B The performance of the machine learning model on the test datasets
Fig. 6
Fig. 6
The results of metabolic analysis. A PCA plot. B Volcano plot of differential metabolites. C Heatmap showed the results of the clustering analysis of differential metabolites. D Correlation between genes from LASSO and differential metabolites. Diamonds represented key genes from LASSO; red lines represented positive correlation; green lines represented negative correlation

Similar articles

References

    1. Mutuku JM, Cui S, Yoshida S, Shirasu K. Orobanchaceae parasite-host interactions. New Phytol. 2021;230:46–59. - PubMed
    1. Louarn J, Boniface MC, Pouilly N, Velasco L, Pérez-Vich B, Vincourt P, Muños S. Sunflower resistance to Broomrape (Orobanche cumana) is controlled by specific QTLs for different parasitism stages. Front Plant Sci. 2016;7:590. - PMC - PubMed
    1. Xu Y, Zhang J, Ma C, Lei Y, Shen G, Jin J, Eaton DAR, Wu J. Comparative genomics of orobanchaceous species with different parasitic lifestyles reveals the origin and Stepwise evolution of plant parasitism. Mol Plant. 2022;15:1384–99. - PubMed
    1. Chabaud M, Auriac M-C, Boniface M-C, Delgrange S, Folletti T, Jardinaud M-F, Legendre A, Pérez-Vich B, Pouvreau J-B, Velasco L et al. Wild Helianthus species: A reservoir of resistance genes for sustainable pyramidal resistance to Broomrape in sunflower. Front Plant Sci. 2022;13:1038684. - PMC - PubMed
    1. Saabna N, Keasar T. Parasitoids for biological control in dryland agroecosystems. Curr Opin Insect Sci. 2024;64:101226. - PubMed

LinkOut - more resources