Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul-Aug;22(4):1564-1573.
doi: 10.1109/TCBBIO.2025.3562082.

DualF-PBR: Dual-Extracting Protein Sequence Features for Predicting Plant Resistance Proteins

DualF-PBR: Dual-Extracting Protein Sequence Features for Predicting Plant Resistance Proteins

Hui Fang et al. IEEE Trans Comput Biol Bioinform. 2025 Jul-Aug.

Abstract

Plant resistance proteins are evolved during growth and development to cope with complex environmental changes and infection of pathogens. Predicting plant resistance proteins is of great significance for further exploring plant disease resistance mechanism against viruses. In this paper, we propose a method for predicting plant resistance protein by dual-extracting features. The dual-extracted features are composed of the features extracted by modeling self-attention neural network and detecting sequence structure information respectively to obtain 2381-dimensional protein sequence features. We utilize the Least Absolute Shrinkage and Selection Operator (LASSO) algorithm to eliminate redundant features from the extracted 2381-dimensional features to form 53 key features. These 53 key features are inputted into the Lightweight Gradient Boosting Machine (LightGBM) model to predict plant resistance proteins. Experimental results of five-fold cross-validation on real datasets demonstrate that our proposed prediction method outperforms existing methods overall in accuracy, sensitivity, specificity, Matthews correlation coefficient, F1 score, and area under the curve (AUC) in the case of slightly imbalanced datasets. This research work will aid in filtrating plant resistance genes and proteins, and promote disease-resistant breeding for plants.

PubMed Disclaimer