3CAC: improving the classification of phages and plasmids in metagenomic assemblies using assembly graphs
- PMID: 36124804
- DOI: 10.1093/bioinformatics/btac468
3CAC: improving the classification of phages and plasmids in metagenomic assemblies using assembly graphs
Abstract
Motivation: Bacteriophages and plasmids usually coexist with their host bacteria in microbial communities and play important roles in microbial evolution. Accurately identifying sequence contigs as phages, plasmids and bacterial chromosomes in mixed metagenomic assemblies is critical for further unraveling their functions. Many classification tools have been developed for identifying either phages or plasmids in metagenomic assemblies. However, only two classifiers, PPR-Meta and viralVerify, were proposed to simultaneously identify phages and plasmids in mixed metagenomic assemblies. Due to the very high fraction of chromosome contigs in the assemblies, both tools achieve high precision in the classification of chromosomes but perform poorly in classifying phages and plasmids. Short contigs in these assemblies are often wrongly classified or classified as uncertain.
Results: Here we present 3CAC, a new three-class classifier that improves the precision of phage and plasmid classification. 3CAC starts with an initial three-class classification generated by existing classifiers and improves the classification of short contigs and contigs with low confidence classification by using proximity in the assembly graph. Evaluation on simulated metagenomes and on real human gut microbiome samples showed that 3CAC outperformed PPR-Meta and viralVerify in both precision and recall, and increased F1-score by 10-60 percentage points.
Availability and implementation: The 3CAC software is available on https://github.com/Shamir-Lab/3CAC.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Similar articles
-
PPR-Meta: a tool for identifying phages and plasmids from metagenomic fragments using deep learning.Gigascience. 2019 Jun 1;8(6):giz066. doi: 10.1093/gigascience/giz066. Gigascience. 2019. PMID: 31220250 Free PMC article.
-
4CAC: 4-class classifier of metagenome contigs using machine learning and assembly graphs.Nucleic Acids Res. 2024 Oct 28;52(19):e94. doi: 10.1093/nar/gkae799. Nucleic Acids Res. 2024. PMID: 39287139 Free PMC article.
-
SCAPP: an algorithm for improved plasmid assembly in metagenomes.Microbiome. 2021 Jun 25;9(1):144. doi: 10.1186/s40168-021-01068-z. Microbiome. 2021. PMID: 34172093 Free PMC article.
-
Fishing for phages in metagenomes: what do we catch, what do we miss?Curr Opin Virol. 2021 Aug;49:142-150. doi: 10.1016/j.coviro.2021.05.008. Epub 2021 Jun 15. Curr Opin Virol. 2021. PMID: 34139668 Review.
-
Phage family classification under Caudoviricetes: A review of current tools using the latest ICTV classification framework.Front Microbiol. 2022 Dec 16;13:1032186. doi: 10.3389/fmicb.2022.1032186. eCollection 2022. Front Microbiol. 2022. PMID: 36590402 Free PMC article. Review.
Cited by
-
Vertebral Column Pathology Diagnosis Using Ensemble Strategies Based on Supervised Machine Learning Techniques.Healthcare (Basel). 2024 Jul 2;12(13):1324. doi: 10.3390/healthcare12131324. Healthcare (Basel). 2024. PMID: 38998860 Free PMC article.
-
Classification of bacterial plasmid and chromosome derived sequences using machine learning.PLoS One. 2022 Dec 16;17(12):e0279280. doi: 10.1371/journal.pone.0279280. eCollection 2022. PLoS One. 2022. PMID: 36525447 Free PMC article.
-
Exploring microbial functional biodiversity at the protein family level-From metagenomic sequence reads to annotated protein clusters.Front Bioinform. 2023 Mar 3;3:1157956. doi: 10.3389/fbinf.2023.1157956. eCollection 2023. Front Bioinform. 2023. PMID: 36959975 Free PMC article. Review.
-
Evaluation of computational phage detection tools for metagenomic datasets.Front Microbiol. 2023 Jan 25;14:1078760. doi: 10.3389/fmicb.2023.1078760. eCollection 2023. Front Microbiol. 2023. PMID: 36760501 Free PMC article.
-
plASgraph2: using graph neural networks to detect plasmid contigs from an assembly graph.Front Microbiol. 2023 Oct 6;14:1267695. doi: 10.3389/fmicb.2023.1267695. eCollection 2023. Front Microbiol. 2023. PMID: 37869681 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous