POCP-nf: an automatic Nextflow pipeline for calculating the percentage of conserved proteins in bacterial taxonomy
- PMID: 38561180
- PMCID: PMC11256958
- DOI: 10.1093/bioinformatics/btae175
POCP-nf: an automatic Nextflow pipeline for calculating the percentage of conserved proteins in bacterial taxonomy
Abstract
Summary: Sequence technology advancements have led to an exponential increase in bacterial genomes, necessitating robust taxonomic classification methods. The Percentage Of Conserved Proteins (POCP), proposed initially by Qin et al. (2014), is a valuable metric for assessing prokaryote genus boundaries. Here, I introduce a computational pipeline for automated POCP calculation, aiming to enhance reproducibility and ease of use in taxonomic studies.
Availability and implementation: The POCP-nf pipeline uses DIAMOND for faster protein alignments, achieving similar sensitivity to BLASTP. The pipeline is implemented in Nextflow with Conda and Docker support and is freely available on GitHub under https://github.com/hoelzer/pocp. The open-source code can be easily adapted for various prokaryotic genome and protein datasets. Detailed documentation and usage instructions are provided in the repository.
© The Author(s) 2024. Published by Oxford University Press.
Conflict of interest statement
None declared.
Figures

References
-
- Aliyu H, Lebre P, Blom J et al. Phylogenomic re-assessment of the thermophilic genus Geobacillus. Syst Appl Microbiol 2016;39:527–33. - PubMed
-
- Amulyasai B, Anusha R, Sasikala C et al. Phylogenomic analysis of a metagenome-assembled genome indicates a new taxon of an anoxygenic phototroph bacterium in the family Chromatiaceae and the proposal of “Candidatus thioaporhodococcus” gen. nov. Arch Microbiol 2022;204:688. - PubMed
-
- Azpiazu-Muniozguren M, García M, Laorden L et al. Anianabacter salinae gen. nov., sp. nov. ASV31T, a facultative alkaliphilic and extremely halotolerant bacterium isolated from brine of a millennial continental saltern. Diversity 2022;14:1009.
MeSH terms
LinkOut - more resources
Full Text Sources