BHap: a novel approach for bacterial haplotype reconstruction
- PMID: 31004480
- PMCID: PMC6931272
- DOI: 10.1093/bioinformatics/btz280
BHap: a novel approach for bacterial haplotype reconstruction
Abstract
Motivation: The bacterial haplotype reconstruction is critical for selecting proper treatments for diseases caused by unknown haplotypes. Existing methods and tools do not work well on this task, because they are usually developed for viral instead of bacterial populations.
Results: In this study, we developed BHap, a novel algorithm based on fuzzy flow networks, for reconstructing bacterial haplotypes from next generation sequencing data. Tested on simulated and experimental datasets, we showed that BHap was capable of reconstructing haplotypes of bacterial populations with an average F1 score of 0.87, an average precision of 0.87 and an average recall of 0.88. We also demonstrated that BHap had a low susceptibility to sequencing errors, was capable of reconstructing haplotypes with low coverage and could handle a wide range of mutation rates. Compared with existing approaches, BHap outperformed them in terms of higher F1 scores, better precision, better recall and more accurate estimation of the number of haplotypes.
Availability and implementation: The BHap tool is available at http://www.cs.ucf.edu/∼xiaoman/BHap/.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Figures



Similar articles
-
Reconstructing viral haplotypes using long reads.Bioinformatics. 2022 Apr 12;38(8):2127-2134. doi: 10.1093/bioinformatics/btac089. Bioinformatics. 2022. PMID: 35157018
-
Evaluation of haplotype callers for next-generation sequencing of viruses.Infect Genet Evol. 2020 Aug;82:104277. doi: 10.1016/j.meegid.2020.104277. Epub 2020 Mar 6. Infect Genet Evol. 2020. PMID: 32151775 Free PMC article.
-
De novo haplotype reconstruction in viral quasispecies using paired-end read guided path finding.Bioinformatics. 2018 Sep 1;34(17):2927-2935. doi: 10.1093/bioinformatics/bty202. Bioinformatics. 2018. PMID: 29617936
-
mixtureS: a novel tool for bacterial strain genome reconstruction from reads.Bioinformatics. 2021 May 1;37(4):575-577. doi: 10.1093/bioinformatics/btaa728. Bioinformatics. 2021. PMID: 32805048 Free PMC article.
-
Benchmarking of viral haplotype reconstruction programmes: an overview of the capacities and limitations of currently available programmes.Brief Bioinform. 2014 May;15(3):431-42. doi: 10.1093/bib/bbs081. Epub 2012 Dec 19. Brief Bioinform. 2014. PMID: 23257116 Review.
Cited by
-
Scalable Microbial Strain Inference in Metagenomic Data Using StrainFacts.Front Bioinform. 2022 May 16;2:867386. doi: 10.3389/fbinf.2022.867386. eCollection 2022. Front Bioinform. 2022. PMID: 36304283 Free PMC article.
-
Sequencing-based analysis of microbiomes.Nat Rev Genet. 2024 Dec;25(12):829-845. doi: 10.1038/s41576-024-00746-6. Epub 2024 Jun 25. Nat Rev Genet. 2024. PMID: 38918544 Review.
-
Reconstruction of evolving gene variants and fitness from short sequencing reads.Nat Chem Biol. 2021 Nov;17(11):1188-1198. doi: 10.1038/s41589-021-00876-6. Epub 2021 Oct 11. Nat Chem Biol. 2021. PMID: 34635842 Free PMC article.
-
A revisit to universal single-copy genes in bacterial genomes.Sci Rep. 2022 Aug 25;12(1):14550. doi: 10.1038/s41598-022-18762-z. Sci Rep. 2022. PMID: 36008577 Free PMC article.
-
Floria: fast and accurate strain haplotyping in metagenomes.Bioinformatics. 2024 Jun 28;40(Suppl 1):i30-i38. doi: 10.1093/bioinformatics/btae252. Bioinformatics. 2024. PMID: 38940183 Free PMC article.
References
-
- Glenn T.C. (2011) Field guide to next‐generation DNA sequencers. Mol. Ecol. Resour., 11, 759–769. - PubMed