Large disclosing the nature of computational tools for the analysis of next generation sequencing data
- PMID: 22690679
- DOI: 10.2174/156802612801319007
Large disclosing the nature of computational tools for the analysis of next generation sequencing data
Abstract
Next-generation sequencing (NGS) technologies are rapidly changing the approach to complex genomic studies, opening the way to personalized drugs development and personalized medicine. NGS technologies are characterized by a massive throughput for relatively short-sequences (30-100), and they are currently the most reliable and accurate method for grouping individuals on the basis of their genetic profiles. The first and crucial step in sequence analysis is the conversion of millions of short sequences (reads) into valuable genetic information by their mapping to a known (reference) genome. New computational methods, specifically designed for the type and the amount of data generated by NGS technologies, are replacing earlier widespread genome alignment algorithms which are unable to cope with such massive amount of data. This review provides an overview of the bioinformatics techniques that have been developed for the mapping of NGS data onto a reference genome, with a special focus on polymorphism rate and sequence error detection. The different techniques have been experimented on an appropriately defined dataset, to investigate their relative computational costs and usability, as seen from an user perspective. Since NGS platforms interrogate the genome using either the conventional nucleotide space or the more recent color space, this review does consider techniques both in nucleotide and color space, emphasizing similarities and diversities.
Similar articles
-
Review of alignment and SNP calling algorithms for next-generation sequencing data.J Appl Genet. 2016 Feb;57(1):71-9. doi: 10.1007/s13353-015-0292-7. Epub 2015 Jun 9. J Appl Genet. 2016. PMID: 26055432 Review.
-
A comparative study of k-spectrum-based error correction methods for next-generation sequencing data analysis.Hum Genomics. 2016 Jul 25;10 Suppl 2(Suppl 2):20. doi: 10.1186/s40246-016-0068-0. Hum Genomics. 2016. PMID: 27461106 Free PMC article.
-
SeqAssist: a novel toolkit for preliminary analysis of next-generation sequencing data.BMC Bioinformatics. 2014;15 Suppl 11(Suppl 11):S10. doi: 10.1186/1471-2105-15-S11-S10. Epub 2014 Oct 21. BMC Bioinformatics. 2014. PMID: 25349885 Free PMC article.
-
Fast and memory efficient approach for mapping NGS reads to a reference genome.J Bioinform Comput Biol. 2019 Apr;17(2):1950008. doi: 10.1142/S0219720019500082. J Bioinform Comput Biol. 2019. PMID: 31057068
-
Next-generation sequencing technologies and fragment assembly algorithms.Methods Mol Biol. 2012;855:155-74. doi: 10.1007/978-1-61779-582-4_5. Methods Mol Biol. 2012. PMID: 22407708 Review.
Cited by
-
Navigating the rapids: the development of regulated next-generation sequencing-based clinical trial assays and companion diagnostics.Front Oncol. 2014 Apr 17;4:78. doi: 10.3389/fonc.2014.00078. eCollection 2014. Front Oncol. 2014. PMID: 24860780 Free PMC article. Review.
-
Next-generation sequencing: from understanding biology to personalized medicine.Biology (Basel). 2013 Mar 1;2(1):378-98. doi: 10.3390/biology2010378. Biology (Basel). 2013. PMID: 24832667 Free PMC article.
-
An integrative computational approach for prioritization of genomic variants.PLoS One. 2014 Dec 15;9(12):e114903. doi: 10.1371/journal.pone.0114903. eCollection 2014. PLoS One. 2014. PMID: 25506935 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources