Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Sep 15;5(9):e12762.
doi: 10.1371/journal.pone.0012762.

The complete chloroplast genome sequence of date palm (Phoenix dactylifera L.)

Affiliations

The complete chloroplast genome sequence of date palm (Phoenix dactylifera L.)

Meng Yang et al. PLoS One. .

Abstract

Background: Date palm (Phoenix dactylifera L.), a member of Arecaceae family, is one of the three major economically important woody palms--the two other palms being oil palm and coconut tree--and its fruit is a staple food among Middle East and North African nations, as well as many other tropical and subtropical regions. Here we report a complete sequence of the data palm chloroplast (cp) genome based on pyrosequencing.

Methodology/principal findings: After extracting 369,022 cp sequencing reads from our whole-genome-shotgun data, we put together an assembly and validated it with intensive PCR-based verification, coupled with PCR product sequencing. The date palm cp genome is 158,462 bp in length and has a typical quadripartite structure of the large (LSC, 86,198 bp) and small single-copy (SSC, 17,712 bp) regions separated by a pair of inverted repeats (IRs, 27,276 bp). Similar to what has been found among most angiosperms, the date palm cp genome harbors 112 unique genes and 19 duplicated fragments in the IR regions. The junctions between LSC/IRs and SSC/IRs show different features of sequence expansion in evolution. We identified 78 SNPs as major intravarietal polymorphisms within the population of a specific cp genome, most of which were located in genes with vital functions. Based on RNA-sequencing data, we also found 18 polycistronic transcription units and three highly expression-biased genes--atpF, trnA-UGC, and rrn23.

Conclusions: Unlike most monocots, date palm has a typical cp genome similar to that of tobacco--with little rearrangement and gene loss or gain. High-throughput sequencing technology facilitates the identification of intravarietal variations in cp genomes among different cultivars. Moreover, transcriptomic analysis of cp genes provides clues for uncovering regulatory mechanisms of transcription and translation in chloroplasts.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. The map of date palm cp genome sequence.
Genes shown outside the outer circle are transcribed clockwise, whereas those shown inside are transcribed counterclockwise. The thick line indicates IRs. The genome coordinate is shown in the inner circle. Genes belonging to different groups are color-coded (Ψ, pseudogene).
Figure 2
Figure 2. A multiple alignment of cemA genes from representative monocot cp genomes.
The start codon is assigned as origin. The possible stop codons in the upstream regions are marked by ellipse. 663-bp less-relevant aligned sequences were omitted.
Figure 3
Figure 3. Expansion of IR into the LSC region.
The dashed lines depict the junction boundaries. The arrowed grey lines indicate the process of IR expansions. a, Amborella and Nicotiana; b, Lemna; c, intermediate; d, Acorus and Dioscorea; e, date palm, typha, and all Poaceaes; f, Phalaenopsis and Oncidium.
Figure 4
Figure 4. Expansion of IR into the SSC region.
The dashed lines depict the junction boundaries. The arrowed grey lines indicate the process of IR expansions. a, Amborella, Nicotiana, typha and two Acoraceae members; b, Phalaenopsis; c, Dioscorea; d, date palm and Arabidopsis; e, intermediate; f, Lemna; g, intermediate; h, most Poaceaes; i, Panicoideae subfamily including Coix, Maize, Saccharum and Sorghum.
Figure 5
Figure 5. Folded stem-loop structures of the new SI from four representative species.
Free energy values (dG) are shown for SI in each species.

Similar articles

Cited by

References

    1. Howe CJ, Barbrook AC, Koumandou VL, Nisbet RE, Symington HA, et al. Evolution of the chloroplast genome. Philos Trans R Soc Lond B Biol Sci. 2003;358:99–106; discussion 106-107. - PMC - PubMed
    1. Pyke KA. Plastid division and development. Plant Cell. 1999;11:549–556. - PMC - PubMed
    1. Bausher MG, Singh ND, Lee SB, Jansen RK, Daniell H. The complete chloroplast genome sequence of Citrus sinensis (L.) Osbeck var ‘Ridge Pineapple’: organization and phylogenetic relationships to other angiosperms. BMC Plant Biol. 2006;6:21. - PMC - PubMed
    1. Chumley TW, Palmer JD, Mower JP, Fourcade HM, Calie PJ, et al. The complete chloroplast genome sequence of Pelargonium x hortorum: organization and evolution of the largest and most highly rearranged chloroplast genome of land plants. Mol Biol Evol. 2006;23:2175–2190. - PubMed
    1. Shinozaki K, Ohme M, Tanaka M, Wakasugi T, Hayashida N, et al. The complete nucleotide sequence of the tobacco chloroplast genome: its gene organization and expression. EMBO J. 1986;5:2043–2049. - PMC - PubMed

Publication types

LinkOut - more resources