Full-length haplotype reconstruction to infer the structure of heterogeneous virus populations
- PMID: 24972832
- PMCID: PMC4132706
- DOI: 10.1093/nar/gku537
Full-length haplotype reconstruction to infer the structure of heterogeneous virus populations
Abstract
Next-generation sequencing (NGS) technologies enable new insights into the diversity of virus populations within their hosts. Diversity estimation is currently restricted to single-nucleotide variants or to local fragments of no more than a few hundred nucleotides defined by the length of sequence reads. To study complex heterogeneous virus populations comprehensively, novel methods are required that allow for complete reconstruction of the individual viral haplotypes. Here, we show that assembly of whole viral genomes of ∼8600 nucleotides length is feasible from mixtures of heterogeneous HIV-1 strains derived from defined combinations of cloned virus strains and from clinical samples of an HIV-1 superinfected individual. Haplotype reconstruction was achieved using optimized experimental protocols and computational methods for amplification, sequencing and assembly. We comparatively assessed the performance of the three NGS platforms 454 Life Sciences/Roche, Illumina and Pacific Biosciences for this task. Our results prove and delineate the feasibility of NGS-based full-length viral haplotype reconstruction and provide new tools for studying evolution and pathogenesis of viruses.
© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Figures
Similar articles
-
Evaluation of haplotype callers for next-generation sequencing of viruses.Infect Genet Evol. 2020 Aug;82:104277. doi: 10.1016/j.meegid.2020.104277. Epub 2020 Mar 6. Infect Genet Evol. 2020. PMID: 32151775 Free PMC article.
-
A penalized regression approach to haplotype reconstruction of viral populations arising in early HIV/SIV infection.Bioinformatics. 2017 Aug 15;33(16):2455-2463. doi: 10.1093/bioinformatics/btx187. Bioinformatics. 2017. PMID: 28379346 Free PMC article.
-
Next-generation sequencing of HIV-1 RNA genomes: determination of error rates and minimizing artificial recombination.PLoS One. 2013 Sep 18;8(9):e74249. doi: 10.1371/journal.pone.0074249. eCollection 2013. PLoS One. 2013. PMID: 24058534 Free PMC article.
-
Benchmarking of viral haplotype reconstruction programmes: an overview of the capacities and limitations of currently available programmes.Brief Bioinform. 2014 May;15(3):431-42. doi: 10.1093/bib/bbs081. Epub 2012 Dec 19. Brief Bioinform. 2014. PMID: 23257116 Review.
-
Recent advances in inferring viral diversity from high-throughput sequencing data.Virus Res. 2017 Jul 15;239:17-32. doi: 10.1016/j.virusres.2016.09.016. Epub 2016 Sep 28. Virus Res. 2017. PMID: 27693290 Review.
Cited by
-
Improving the sensitivity of long read overlap detection using grouped short k-mer matches.BMC Genomics. 2019 Apr 4;20(Suppl 2):190. doi: 10.1186/s12864-019-5475-x. BMC Genomics. 2019. PMID: 30967123 Free PMC article.
-
Mapping the Evolutionary Potential of RNA Viruses.Cell Host Microbe. 2018 Apr 11;23(4):435-446. doi: 10.1016/j.chom.2018.03.012. Cell Host Microbe. 2018. PMID: 29649440 Free PMC article. Review.
-
Mutational signatures and heterogeneous host response revealed via large-scale characterization of SARS-CoV-2 genomic diversity.iScience. 2021 Feb 19;24(2):102116. doi: 10.1016/j.isci.2021.102116. Epub 2021 Jan 28. iScience. 2021. PMID: 33532709 Free PMC article.
-
VILOCA: sequencing quality-aware viral haplotype reconstruction and mutation calling for short-read and long-read data.NAR Genom Bioinform. 2024 Nov 28;6(4):lqae152. doi: 10.1093/nargab/lqae152. eCollection 2024 Dec. NAR Genom Bioinform. 2024. PMID: 39633724 Free PMC article.
-
Self-reported neurocognitive complaints in the Swiss HIV Cohort Study: a viral genome-wide association study.Brain Commun. 2024 May 31;6(4):fcae188. doi: 10.1093/braincomms/fcae188. eCollection 2024. Brain Commun. 2024. PMID: 38961872 Free PMC article.
References
-
- Nowak M.A. What is a quasispecies? Trends Ecol. Evol. 1992;7:118–121. - PubMed
-
- Metzker M.L. Sequencing technologies—the next generation. Nat. Rev. Genet. 2010;11:31–46. - PubMed
-
- Macalalad A.R., Zody M.C., Charlebois P., Lennon N.J., Newman R.M., Malboeuf C.M., Ryan E.M., Boutwell C.L., Power K.A., Brackney D.E., et al. Highly sensitive and specific detection of rare variants in mixed viral populations from massively parallel sequence data. PLoS Comput. Biol. 2012;8:e1002417. - PMC - PubMed
-
- Henn M.R., Boutwell C.L., Charlebois P., Lennon N.J., Power K.A., Macalalad A.R., Berlin A.M., Malboeuf C.M., Ryan E.M., Gnerre S., et al. Whole genome deep sequencing of HIV-1 reveals the impact of early minor variants upon immune recognition during acute infection. PLoS Pathog. 2012;8:e1002529. - PMC - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
