Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 7:14:1277121.
doi: 10.3389/fmicb.2023.1277121. eCollection 2023.

Oral_voting_transfer: classification of oral microorganisms' function proteins with voting transfer model

Affiliations

Oral_voting_transfer: classification of oral microorganisms' function proteins with voting transfer model

Wenzheng Bao et al. Front Microbiol. .

Abstract

Introduction: The oral microbial group typically represents the human body's highly complex microbial group ecosystem. Oral microorganisms take part in human diseases, including Oral cavity inflammation, mucosal disease, periodontal disease, tooth decay, and oral cancer. On the other hand, oral microbes can also cause endocrine disorders, digestive function, and nerve function disorders, such as diabetes, digestive system diseases, and Alzheimer's disease. It was noted that the proteins of oral microbes play significant roles in these serious diseases. Having a good knowledge of oral microbes can be helpful in analyzing the procession of related diseases. Moreover, the high-dimensional features and imbalanced data lead to the complexity of oral microbial issues, which can hardly be solved with traditional experimental methods.

Methods: To deal with these challenges, we proposed a novel method, which is oral_voting_transfer, to deal with such classification issues in the field of oral microorganisms. Such a method employed three features to classify the five oral microorganisms, including Streptococcus mutans, Staphylococcus aureus, abiotrophy adjacent, bifidobacterial, and Capnocytophaga. Firstly, we utilized the highly effective model, which successfully classifies the organelle's proteins and transfers to deal with the oral microorganisms. And then, some classification methods can be treated as the local classifiers in this work. Finally, the results are voting from the transfer classifiers and the voting ones.

Results and discussion: The proposed method achieved the well performances in the five oral microorganisms. The oral_voting_transfer is a standalone tool, and all its source codes are publicly available at https://github.com/baowz12345/voting_transfer.

Keywords: bioinformatics; classification; machine learning; oral microorganisms proteins; voting transfer model.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
The flowchart of Oral_voting_transfer model
FIGURE 2
FIGURE 2
The ROC of Streptococcus mutans active sites. (A) Is the ROC curve of 10-fold cross validation with TAGPPI. (B) Is the ROC curve of 10-fold cross validation with SeqVec. (C) Is the ROC curve of 10-fold cross validation with ProSE.
FIGURE 3
FIGURE 3
The ROC of Streptococcus mutans binding sites. (A) Is the ROC curve of 10-fold cross validation with TAGPPI. (B) Is the ROC curve of 10-fold cross validation with SeqVec. (C) Is the ROC curve of 10-fold cross validation with ProSE.
FIGURE 4
FIGURE 4
The ROC of Staphylococcus aureus active sites. (A) Is the ROC curve of 10-fold cross validation with TAGPPI. (B) Is the ROC curve of 10-fold cross validation with SeqVec. (C) Is the ROC curve of 10-fold cross validation with ProSE.
FIGURE 5
FIGURE 5
The ROC of Staphylococcus aureus binding sites. (A) Is the ROC curve of 10-fold cross validation with TAGPPI. (B) Is the ROC curve of 10-fold cross validation with SeqVec. (C) Is the ROC curve of 10-fold cross validation with ProSE.
FIGURE 6
FIGURE 6
The ROC of Abiotrophia adjacens active sites. (A) Is the ROC curve of 10-fold cross validation with TAGPPI. (B) Is the ROC curve of 10-fold cross validation with SeqVec. (C) Is the ROC curve of 10-fold cross validation with ProSE.
FIGURE 7
FIGURE 7
The ROC of Abiotrophia adiacens binding sites. (A) Is the ROC curve of 10-fold cross validation with TAGPPI. (B) Is the ROC curve of 10-fold cross validation with SeqVec. (C) Is the ROC curve of 10-fold cross validation with ProSE.
FIGURE 8
FIGURE 8
The ROC of Bifidobacterial active sites. (A) Is the ROC curve of 10-fold cross validation with TAGPPI. (B) Is the ROC curve of 10-fold cross validation with SeqVec. (C) Is the ROC curve of 10-fold cross validation with ProSE.
FIGURE 9
FIGURE 9
The ROC of Bifidobacterial binding sites. (A) Is the ROC curve of 10-fold cross validation with TAGPPI. (B) Is the ROC curve of 10-fold cross validation with SeqVec. (C) Is the ROC curve of 10-fold cross validation with ProSE.
FIGURE 10
FIGURE 10
The ROC of Capnocytophaga active sites. (A) Is the ROC curve of 10-fold cross validation with TAGPPI. (B) Is the ROC curve of 10-fold cross validation with SeqVec. (C) Is the ROC curve of 10-fold cross validation with ProSE.
FIGURE 11
FIGURE 11
The ROC of Capnocytophaga binding sites. (A) Is the ROC curve of 10-fold cross validation with TAGPPI. (B) Is the ROC curve of 10-fold cross validation with SeqVec. (C) Is the ROC curve of 10-fold cross validation with ProSE.

References

    1. Arlot S., Genuer R. (2016). Comments on: A random forest guided tour. Test 25, 228–238. 10.1007/s11749-016-0484-4 - DOI
    1. Awais M., Hussain W., Khan Y., Rasool N., Khan S., Chou K. (2019). iPhosH-PseAAC: identify phosphohistidine sites in proteins by blending statistical moments and position relative features according to the Chou’s 5-step rule and general pseudo amino acid composition. IEEE/ACM Trans. Comput. Biol. Bioinform. 18 596–610. 10.1109/TCBB.2019.2919025 - DOI - PubMed
    1. Bradford J. R., Westhead D. R. (2005). Improved prediction of protein–protein binding sites using a support vector machines approach. Bioinformatics 21 1487–1494. 10.1093/bioinformatics/bti242 - DOI - PubMed
    1. Brohee S., van Helden J. (2006). Evaluation of clustering algorithms for protein-protein interaction networks. BMC Bioinformatics 7:488. 10.1186/1471-2105-7-488 - DOI - PMC - PubMed
    1. Chatterjee P., Basu S., Kundu M., Nasipuri M., Plewczynski D. (2011). PPI_SVM: prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables. Cell. Mol. Biol. Lett. 16 264–278. 10.2478/s11658-011-0008-x - DOI - PMC - PubMed

LinkOut - more resources