The genome of flax (Linum usitatissimum) assembled de novo from short shotgun sequence reads
- PMID: 22757964
- DOI: 10.1111/j.1365-313X.2012.05093.x
The genome of flax (Linum usitatissimum) assembled de novo from short shotgun sequence reads
Abstract
Flax (Linum usitatissimum) is an ancient crop that is widely cultivated as a source of fiber, oil and medicinally relevant compounds. To accelerate crop improvement, we performed whole-genome shotgun sequencing of the nuclear genome of flax. Seven paired-end libraries ranging in size from 300 bp to 10 kb were sequenced using an Illumina genome analyzer. A de novo assembly, comprised exclusively of deep-coverage (approximately 94× raw, approximately 69× filtered) short-sequence reads (44-100 bp), produced a set of scaffolds with N(50) =694 kb, including contigs with N(50)=20.1 kb. The contig assembly contained 302 Mb of non-redundant sequence representing an estimated 81% genome coverage. Up to 96% of published flax ESTs aligned to the whole-genome shotgun scaffolds. However, comparisons with independently sequenced BACs and fosmids showed some mis-assembly of regions at the genome scale. A total of 43384 protein-coding genes were predicted in the whole-genome shotgun assembly, and up to 93% of published flax ESTs, and 86% of A. thaliana genes aligned to these predicted genes, indicating excellent coverage and accuracy at the gene level. Analysis of the synonymous substitution rates (K(s) ) observed within duplicate gene pairs was consistent with a recent (5-9 MYA) whole-genome duplication in flax. Within the predicted proteome, we observed enrichment of many conserved domains (Pfam-A) that may contribute to the unique properties of this crop, including agglutinin proteins. Together these results show that de novo assembly, based solely on whole-genome shotgun short-sequence reads, is an efficient means of obtaining nearly complete genome sequence information for some plant species.
© 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.
Similar articles
-
Physical mapping and BAC-end sequence analysis provide initial insights into the flax (Linum usitatissimum L.) genome.BMC Genomics. 2011 May 9;12:217. doi: 10.1186/1471-2164-12-217. BMC Genomics. 2011. PMID: 21554714 Free PMC article.
-
Multiplexed shotgun sequencing reveals congruent three-genome phylogenetic signals for four botanical sections of the flax genus Linum.Mol Phylogenet Evol. 2016 Aug;101:122-132. doi: 10.1016/j.ympev.2016.05.010. Epub 2016 May 7. Mol Phylogenet Evol. 2016. PMID: 27165939
-
Characterization of 954 bovine full-CDS cDNA sequences.BMC Genomics. 2005 Nov 23;6:166. doi: 10.1186/1471-2164-6-166. BMC Genomics. 2005. PMID: 16305752 Free PMC article.
-
De novo sequencing of plant genomes using second-generation technologies.Brief Bioinform. 2009 Nov;10(6):609-18. doi: 10.1093/bib/bbp039. Brief Bioinform. 2009. PMID: 19933209 Review.
-
Genome-wide identification of ATP binding cassette (ABC) transporter and heavy metal associated (HMA) gene families in flax (Linum usitatissimum L.).BMC Genomics. 2020 Oct 19;21(1):722. doi: 10.1186/s12864-020-07121-9. BMC Genomics. 2020. PMID: 33076828 Free PMC article. Review.
Cited by
-
Structural organization of fatty acid desaturase loci in linseed lines with contrasting linolenic acid contents.Funct Integr Genomics. 2016 Jul;16(4):429-39. doi: 10.1007/s10142-016-0494-z. Epub 2016 May 3. Funct Integr Genomics. 2016. PMID: 27142663
-
Heterologous expression of flax PHOSPHOLIPID:DIACYLGLYCEROL CHOLINEPHOSPHOTRANSFERASE (PDCT) increases polyunsaturated fatty acid content in yeast and Arabidopsis seeds.BMC Biotechnol. 2015 Jun 30;15:63. doi: 10.1186/s12896-015-0156-6. BMC Biotechnol. 2015. PMID: 26123542 Free PMC article.
-
Genetic characterization of a core collection of flax (Linum usitatissimum L.) suitable for association mapping studies and evidence of divergent selection between fiber and linseed types.BMC Plant Biol. 2013 May 6;13:78. doi: 10.1186/1471-2229-13-78. BMC Plant Biol. 2013. PMID: 23647851 Free PMC article.
-
A chromosome-level genome assembly provides insights into Cornus wilsoniana evolution, oil biosynthesis, and floral bud development.Hortic Res. 2023 Sep 29;10(11):uhad196. doi: 10.1093/hr/uhad196. eCollection 2023 Nov. Hortic Res. 2023. PMID: 38023476 Free PMC article.
-
In Vivo and in Vitro Evidence for Biochemical Coupling of Reactions Catalyzed by Lysophosphatidylcholine Acyltransferase and Diacylglycerol Acyltransferase.J Biol Chem. 2015 Jul 17;290(29):18068-18078. doi: 10.1074/jbc.M115.654798. Epub 2015 Jun 8. J Biol Chem. 2015. PMID: 26055703 Free PMC article.
Publication types
MeSH terms
Substances
Associated data
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous