Sequencing quality assessment tools to enable data-driven informatics for high throughput genomics
- PMID: 24381581
- PMCID: PMC3865868
- DOI: 10.3389/fgene.2013.00288
Sequencing quality assessment tools to enable data-driven informatics for high throughput genomics
Abstract
The processes of quality assessment and control are an active area of research at The Genome Analysis Centre (TGAC). Unlike other sequencing centers that often concentrate on a certain species or technology, TGAC applies expertise in genomics and bioinformatics to a wide range of projects, often requiring bespoke wet lab and in silico workflows. TGAC is fortunate to have access to a diverse range of sequencing and analysis platforms, and we are at the forefront of investigations into library quality and sequence data assessment. We have developed and implemented a number of algorithms, tools, pipelines and packages to ascertain, store, and expose quality metrics across a number of next-generation sequencing platforms, allowing rapid and in-depth cross-platform Quality Control (QC) bioinformatics. In this review, we describe these tools as a vehicle for data-driven informatics, offering the potential to provide richer context for downstream analysis and to inform experimental design.
Keywords: NGS data analysis; QC; bioinformatics tools; contamination screening; quality assessment and improvement; quality control; run statistics; sequence analysis.
Figures
References
-
- Andrews S. (2010). FastQC: a quality control tool for high throughput sequence data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc
-
- Atlassian (2013). Atlassian JIRA. Available online at: https://www.atlassian.com/software/jira
-
- Burdett T. (2013). Conan2. Available online at: https://github.com/tburdett/Conan2
Publication types
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous
