Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Oct 6;17(10):e0275195.
doi: 10.1371/journal.pone.0275195. eCollection 2022.

A novel deep learning-assisted hybrid network for plasmodium falciparum parasite mitochondrial proteins classification

Affiliations

A novel deep learning-assisted hybrid network for plasmodium falciparum parasite mitochondrial proteins classification

Wafa Alameen Alsanousi et al. PLoS One. .

Abstract

Plasmodium falciparum is a parasitic protozoan that can cause malaria, which is a deadly disease. Therefore, the accurate identification of malaria parasite mitochondrial proteins is essential for understanding their functions and identifying novel drug targets. For classifying protein sequences, several adaptive statistical techniques have been devised. Despite significant gains, prediction performance is still constrained by the lack of appropriate feature descriptors and learning strategies in current systems. Moreover, good ground truth data is important for Artificial Intelligence (AI)-based models but there is a lack of that data in the literature. Therefore, in this work, we propose a novel hybrid network that combines 1D Convolutional Neural Network (CNN) and Bidirectional Gated Recurrent Unit (BGRU) to classify the malaria parasite mitochondrial proteins. Furthermore, we curate a sequential data that are collected from National Center for Biotechnology Information (NCBI) and UniProtKB/Swiss-Prot proteins databanks to prepare a dataset that can be used by the research community for AI-based algorithms evaluation. We obtain 4204 cases after preprocessing of the collected data and denote this set of proteins as PF4204. Finally, we conduct an ablation study on several conventional and deep models using PF4204 and the benchmark PF2095 datasets. The proposed model 'CNN-BGRU' obtains the accuracy values of 0.9096 and 0.9857 on PF4204 and PF2095 datasets, respectively. In addition, the CNN-BGRU is compared with state-of-the-arts, where the results illustrate that it can extract robust features and identify proteins accurately.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Overview of the proposed hybrid architecture for plasmodium mitochondrial proteins classification.
Fig 2
Fig 2. (a) represents unit structure of GRU while (b) shows the working flow of BGRU.
Fig 3
Fig 3. Comparative confusion matrices of different models using PF4204 dataset and hold-out validation method.
Fig 4
Fig 4. Comparative confusion matrices of different models using PF2095 dataset and hold-out validation method.
Fig 5
Fig 5. (a) statistics of the proteins datasets, (b) comparative analysis with state-of-the-art model.

References

    1. Fernández‐Moya S. M. and Estévez A. M., “Posttranscriptional control and the role of RNA‐binding proteins in gene regulation in trypanosomatid protozoan parasites,” Wiley Interdisciplinary Reviews: RNA, vol. 1, pp. 34–46, 2010. doi: 10.1002/wrna.6 - DOI - PubMed
    1. Cibulskis R. E., Alonso P., Aponte J., Aregawi M., Barrette A., Bergeron L., et al.., “Malaria: global progress 2000–2015 and future challenges,” Infectious diseases of poverty, vol. 5, pp. 1–8, 2016. - PMC - PubMed
    1. Gardner M. J., Hall N., Fung E., White O., Berriman M., Hyman R. W., et al.., “Genome sequence of the human malaria parasite Plasmodium falciparum,” Nature, vol. 419, pp. 498–511, 2002. doi: 10.1038/nature01097 - DOI - PMC - PubMed
    1. Cox F. E., “History of the discovery of the malaria parasites and their vectors,” Parasites & vectors, vol. 3, pp. 1–9, 2010. doi: 10.1186/1756-3305-3-5 - DOI - PMC - PubMed
    1. Reid A. J., Talman A. M., Bennett H. M., Gomes A. R., Sanders M. J., Illingworth C. J., et al.., “Single-cell RNA-seq reveals hidden transcriptional variation in malaria parasites,” Elife, vol. 7, p. e33105, 2018. doi: 10.7554/eLife.33105 - DOI - PMC - PubMed

Publication types

Substances