Short reads and nonmodel species: exploring the complexities of next-generation sequence assembly and SNP discovery in the absence of a reference genome
- PMID: 21429166
- DOI: 10.1111/j.1755-0998.2010.02969.x
Short reads and nonmodel species: exploring the complexities of next-generation sequence assembly and SNP discovery in the absence of a reference genome
Abstract
How practical is gene and SNP discovery in a nonmodel species using short read sequences? Next-generation sequencing technologies are being applied to an increasing number of species with no reference genome. For nonmodel species, the cost, availability of existing genetic resources, genome complexity and the planned method of assembly must all be considered when selecting a sequencing platform. Our goal was to examine the feasibility and optimal methodology for SNP and gene discovery in the sockeye salmon (Oncorhynchus nerka) using short read sequences. SOLiD short reads (up to 50 bp) were generated from single- and pooled-tissue transcriptome libraries from ten sockeye salmon. The individuals were from five distinct populations from the Wood River Lakes and Mendeltna Creek, Alaska. As no reference genome was available for sockeye salmon, the SOLiD sequence reads were assembled to publicly available EST reference sequences from sockeye salmon and two closely related species, rainbow trout (Oncorhynchus mykiss) and Atlantic salmon (Salmo salar). Additionally, de novo assembly of the SOLiD data was carried out, and the SOLiD reads were remapped to the de novo contigs. The results from each reference assembly were compared across all references. The number and size of contigs assembled varied with the size reference sequences. In silico SNP discovery was carried out on contigs from all four EST references; however, discovery of valid SNPs was most successful using one of the two conspecific references.
© 2011 Blackwell Publishing Ltd.
Similar articles
-
Development of 54 novel single-nucleotide polymorphism (SNP) assays for sockeye and coho salmon and assessment of available SNPs to differentiate stocks within the Columbia River.Mol Ecol Resour. 2011 Mar;11 Suppl 1:20-30. doi: 10.1111/j.1755-0998.2011.02977.x. Mol Ecol Resour. 2011. PMID: 21429160
-
Use of sequence data from rainbow trout and Atlantic salmon for SNP detection in Pacific salmon.Mol Ecol. 2005 Nov;14(13):4193-203. doi: 10.1111/j.1365-294X.2005.02731.x. Mol Ecol. 2005. PMID: 16262869
-
Benchmarking next-generation transcriptome sequencing for functional and evolutionary genomics.Mol Biol Evol. 2009 Dec;26(12):2731-44. doi: 10.1093/molbev/msp188. Epub 2009 Aug 25. Mol Biol Evol. 2009. PMID: 19706727
-
Strategies for transcriptome analysis in nonmodel plants.Am J Bot. 2012 Feb;99(2):267-76. doi: 10.3732/ajb.1100334. Epub 2012 Feb 1. Am J Bot. 2012. PMID: 22301897 Review.
-
Bioinformatics challenges in de novo transcriptome assembly using short read sequences in the absence of a reference genome sequence.Nat Prod Rep. 2013 Apr;30(4):490-500. doi: 10.1039/c3np20099j. Nat Prod Rep. 2013. PMID: 23377493 Review.
Cited by
-
An evaluation of sequencing coverage and genotyping strategies to assess neutral and adaptive diversity.Mol Ecol Resour. 2019 Nov;19(6):1497-1515. doi: 10.1111/1755-0998.13070. Epub 2019 Sep 9. Mol Ecol Resour. 2019. PMID: 31359622 Free PMC article.
-
Sequence comparative analysis using networks: software for evaluating de novo transcript assembly from next-generation sequencing.Mol Biol Evol. 2013 Aug;30(8):1975-86. doi: 10.1093/molbev/mst087. Epub 2013 May 10. Mol Biol Evol. 2013. PMID: 23666209 Free PMC article.
-
Meiotic maps of sockeye salmon derived from massively parallel DNA sequencing.BMC Genomics. 2012 Oct 3;13:521. doi: 10.1186/1471-2164-13-521. BMC Genomics. 2012. PMID: 23031582 Free PMC article.
-
Identification of Laying-Related SNP Markers in Geese Using RAD Sequencing.PLoS One. 2015 Jul 16;10(7):e0131572. doi: 10.1371/journal.pone.0131572. eCollection 2015. PLoS One. 2015. PMID: 26181055 Free PMC article.
-
De novo assembly and characterization of the root transcriptome of Aegilops variabilis during an interaction with the cereal cyst nematode.BMC Genomics. 2012 Apr 11;13:133. doi: 10.1186/1471-2164-13-133. BMC Genomics. 2012. PMID: 22494814 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials