The Pinus taeda genome is characterized by diverse and highly diverged repetitive sequences
- PMID: 20609256
- PMCID: PMC2996948
- DOI: 10.1186/1471-2164-11-420
The Pinus taeda genome is characterized by diverse and highly diverged repetitive sequences
Abstract
Background: In today's age of genomic discovery, no attempt has been made to comprehensively sequence a gymnosperm genome. The largest genus in the coniferous family Pinaceae is Pinus, whose 110-120 species have extremely large genomes (c. 20-40 Gb, 2N = 24). The size and complexity of these genomes have prompted much speculation as to the feasibility of completing a conifer genome sequence. Conifer genomes are reputed to be highly repetitive, but there is little information available on the nature and identity of repetitive units in gymnosperms. The pines have extensive genetic resources, with approximately 329000 ESTs from eleven species and genetic maps in eight species, including a dense genetic map of the twelve linkage groups in Pinus taeda.
Results: We present here the Sanger sequence and annotation of ten P. taeda BAC clones and Genome Analyzer II whole genome shotgun (WGS) sequences representing 7.5% of the genome. Computational annotation of ten BACs predicts three putative protein-coding genes and at least fifteen likely pseudogenes in nearly one megabase of sequence. We found three conifer-specific LTR retroelements in the BACs, and tentatively identified at least 15 others based on evidence from the distantly related angiosperms. Alignment of WGS sequences to the BACs indicates that 80% of BAC sequences have similar copies (> or = 75% nucleotide identity) elsewhere in the genome, but only 23% have identical copies (99% identity). The three most common repetitive elements in the genome were identified and, when combined, represent less than 5% of the genome.
Conclusions: This study indicates that the majority of repeats in the P. taeda genome are 'novel' and will therefore require additional BAC or genomic sequencing for accurate characterization. The pine genome contains a very large number of diverged and probably defunct repetitive elements. This study also provides new evidence that sequencing a pine genome using a WGS approach is a feasible goal.
Figures


Similar articles
-
Exploring the loblolly pine (Pinus taeda L.) genome by BAC sequencing and Cot analysis.Gene. 2018 Jul 15;663:165-177. doi: 10.1016/j.gene.2018.04.024. Epub 2018 Apr 12. Gene. 2018. PMID: 29655895
-
Unique features of the loblolly pine (Pinus taeda L.) megagenome revealed through sequence annotation.Genetics. 2014 Mar;196(3):891-909. doi: 10.1534/genetics.113.159996. Genetics. 2014. PMID: 24653211 Free PMC article.
-
Insights into the loblolly pine genome: characterization of BAC and fosmid sequences.PLoS One. 2013 Sep 4;8(9):e72439. doi: 10.1371/journal.pone.0072439. eCollection 2013. PLoS One. 2013. PMID: 24023741 Free PMC article.
-
Insights into conifer giga-genomes.Plant Physiol. 2014 Dec;166(4):1724-32. doi: 10.1104/pp.114.248708. Epub 2014 Oct 27. Plant Physiol. 2014. PMID: 25349325 Free PMC article. Review.
-
The cellular and molecular biology of conifer embryogenesis.New Phytol. 2007;176(3):511-536. doi: 10.1111/j.1469-8137.2007.02239.x. New Phytol. 2007. PMID: 17953539 Review.
Cited by
-
Comparative Transcriptomics Among Four White Pine Species.G3 (Bethesda). 2018 May 4;8(5):1461-1474. doi: 10.1534/g3.118.200257. G3 (Bethesda). 2018. PMID: 29559535 Free PMC article.
-
Accommodating the load: The transposable element content of very large genomes.Mob Genet Elements. 2013 Mar 1;3(2):e24775. doi: 10.4161/mge.24775. Mob Genet Elements. 2013. PMID: 24616835 Free PMC article.
-
Glutamate synthases from conifers: gene structure and phylogenetic studies.BMC Genomics. 2018 Jan 19;19(1):65. doi: 10.1186/s12864-018-4454-y. BMC Genomics. 2018. PMID: 29351733 Free PMC article.
-
Comparative transcriptomics of a complex of four European pine species.BMC Genomics. 2015 Mar 25;16(1):234. doi: 10.1186/s12864-015-1401-z. BMC Genomics. 2015. PMID: 25887584 Free PMC article.
-
A whole-genome shotgun approach for assembling and anchoring the hexaploid bread wheat genome.Genome Biol. 2015 Jan 31;16(1):26. doi: 10.1186/s13059-015-0582-8. Genome Biol. 2015. PMID: 25637298 Free PMC article.
References
-
- Wakamiya I, Newton RJ, Johnston JS, Price HJ. Genome Size and Environmental Factors in the Genus Pinus. American Journal of Botany. 1993;80(11):1235–1241. doi: 10.2307/2445706. - DOI
-
- Rabinowicz PD. Constructing gene-enriched plant genomic libraries using methylation filtration technology. Methods Mol Biol. 2003;236:21–36. - PubMed