An intermediate grade of finished genomic sequence suitable for comparative analyses
- PMID: 15479945
- PMCID: PMC525681
- DOI: 10.1101/gr.2648404
An intermediate grade of finished genomic sequence suitable for comparative analyses
Abstract
Although the cost of generating draft-quality genomic sequence continues to decline, refining that sequence by the process of "sequence finishing" remains expensive. Near-perfect finished sequence is an appropriate goal for the human genome and a small set of reference genomes; however, such a high-quality product cannot be cost-justified for large numbers of additional genomes, at least for the foreseeable future. Here we describe the generation and quality of an intermediate grade of finished genomic sequence (termed comparative-grade finished sequence), which is tailored for use in multispecies sequence comparisons. Our analyses indicate that this sequence is very high quality (with the residual gaps and errors mostly falling within repetitive elements) and reflects 99% of the total sequence. Importantly, comparative-grade sequence finishing requires approximately 40-fold less reagents and approximately 10-fold less personnel effort compared to the generation of near-perfect finished sequence, such as that produced for the human genome. Although applied here to finishing sequence derived from individual bacterial artificial chromosome (BAC) clones, one could envision establishing routines for refining sequences emanating from whole-genome shotgun sequencing projects to a similar quality level. Our experience to date demonstrates that comparative-grade sequence finishing represents a practical and affordable option for sequence refinement en route to comparative analyses.
Figures



Similar articles
-
Parallel construction of orthologous sequence-ready clone contig maps in multiple species.Genome Res. 2002 Aug;12(8):1277-85. doi: 10.1101/gr.283202. Genome Res. 2002. PMID: 12176935 Free PMC article.
-
Sequencing of 6.7 Mb of the melon genome using a BAC pooling strategy.BMC Plant Biol. 2010 Nov 12;10:246. doi: 10.1186/1471-2229-10-246. BMC Plant Biol. 2010. PMID: 21073723 Free PMC article.
-
Mouse BAC ends quality assessment and sequence analyses.Genome Res. 2001 Oct;11(10):1736-45. doi: 10.1101/gr.179201. Genome Res. 2001. PMID: 11591651 Free PMC article.
-
454 sequencing put to the test using the complex genome of barley.BMC Genomics. 2006 Oct 26;7:275. doi: 10.1186/1471-2164-7-275. BMC Genomics. 2006. PMID: 17067373 Free PMC article.
-
One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly.Curr Opin Microbiol. 2015 Feb;23:110-20. doi: 10.1016/j.mib.2014.11.014. Epub 2014 Dec 1. Curr Opin Microbiol. 2015. PMID: 25461581 Review.
Cited by
-
Comparative genomic analysis of eutherian fibroblast growth factor genes.BMC Genomics. 2020 Aug 5;21(1):542. doi: 10.1186/s12864-020-06958-4. BMC Genomics. 2020. PMID: 32758140 Free PMC article.
-
Birth-and-death of KLK3 and KLK2 in primates: evolution driven by reproductive biology.Genome Biol Evol. 2012;4(12):1331-8. doi: 10.1093/gbe/evs111. Genome Biol Evol. 2012. PMID: 23204305 Free PMC article.
-
BAC-based sequencing of behaviorally-relevant genes in the prairie vole.PLoS One. 2012;7(1):e29345. doi: 10.1371/journal.pone.0029345. Epub 2012 Jan 6. PLoS One. 2012. PMID: 22238603 Free PMC article.
-
A new strategy for genome assembly using short sequence reads and reduced representation libraries.Genome Res. 2010 Feb;20(2):249-56. doi: 10.1101/gr.097956.109. Genome Res. 2010. PMID: 20123915 Free PMC article.
-
Patterns of tandem repetition in plant whole genome assemblies.Mol Genet Genomics. 2009 Jun;281(6):579-90. doi: 10.1007/s00438-009-0433-y. Epub 2009 Feb 26. Mol Genet Genomics. 2009. PMID: 19242726
References
-
- Adams, M.D., Celniker, S.E., Holt, R.A., Evans, C.A., Gocayne, J.D., Amanatides, P.G., Scherer, S.E., Li, P.W., Hoskins, R.A., Galle, R.F., et al. 2000. The genome sequence of Drosophila melanogaster. Science 287: 2185-2195. - PubMed
-
- Aparicio, S., Chapman, J., Stupka, E., Putnam, N., Chia, J., Dehal, P., Christoffels, A., Rash, S., Hoon, S., Smit, A., et al. 2002. Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes. Science 297: 1301-1310. - PubMed
-
- Ashurst, J.L. and Collins, J.E. 2003. Gene annotation: Prediction and testing. Annu. Rev. Genomics Hum. Genet. 4: 69-88. - PubMed
-
- Bailey, J.A., Gu, Z., Clark, R.A., Reinert, K., Samonte, R.V., Schwartz, S., Adams, M.D., Myers, E.W., Li, P.W., and Eichler, E.E. 2002. Recent segmental duplications in the human genome. Science 297: 1003-1007. - PubMed
WEB SITE REFERENCES
-
- www.nisc.nih.gov/data; source of Supplemental data and information.
-
- www.nisc.nih.gov; NIH Intramural Sequencing Center (NISC) home page and source of information for the NISC Comparative Sequencing Program.
-
- www.genome.wustl.edu/Overview/finrulesname.php?G16=1; quality specifications for the finished human genome sequence.
-
- www.ncbi.nlm.nih.gov/HTGS; definitions of different phases of genomic sequence.
-
- www.phrap.org; source of Phred, Phrap, Consed, and Cross_Match software.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials