Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Nov 9:6:8824.
doi: 10.1038/ncomms9824.

Whole genome re-sequencing of date palms yields insights into diversification of a fruit tree crop

Affiliations

Whole genome re-sequencing of date palms yields insights into diversification of a fruit tree crop

Khaled M Hazzouri et al. Nat Commun. .

Abstract

Date palms (Phoenix dactylifera) are the most significant perennial crop in arid regions of the Middle East and North Africa. Here, we present a comprehensive catalogue of approximately seven million single nucleotide polymorphisms in date palms based on whole genome re-sequencing of a collection of 62 cultivars. Population structure analysis indicates a major genetic divide between North Africa and the Middle East/South Asian date palms, with evidence of admixture in cultivars from Egypt and Sudan. Genome-wide scans for selection suggest at least 56 genomic regions associated with selective sweeps that may underlie geographic adaptation. We report candidate mutations for trait variation, including nonsense polymorphisms and presence/absence variation in gene content in pathways for key agronomic traits. We also identify a copia-like retrotransposon insertion polymorphism in the R2R3 myb-like orthologue of the oil palm virescens gene associated with fruit colour variation. This analysis documents patterns of post-domestication diversification and provides a genomic resource for this economically important perennial tree crop.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Summary of single nucleotide polymorphisms in 62 date palm cultivars.
(a) Circos plot of the 50 longest scaffolds (18.4% of the cv. Khalas assembly). Tracks from outer to inner are θW (grid lines are drawn at 0.01 intervals) and SNP density (grid lines are drawn at 250-SNP intervals) in non-overlapping 25-kb bins. Gene density and transposable element densities in sliding windows of 100 kb with step size 20 kb. A zoom in scaffold S000001 showing π and SNP density in non-overlapping 10-kb bins (b) Distribution of SNPs among functional effect classes compared with the proportion of sites in the reference (cv. Khalas) genome. (c) Minor allele site-frequency spectrum among selected coding region site classes. (d) Decay of linkage disequilibrium measured as the squared correlation coefficient (r2) by physical distance in 62 cultivars.
Figure 2
Figure 2. Population structure in cultivated date palm.
(a) Principal component (PC) analysis of SNP genotypes based on ∼25,000 SNPs for 62 cultivars. PC1, 2 and 3 axes account for 11.96, 4.03 and 3.69% of the variation, respectively. (b) Neighbour-joining tree based on the distance metric of Gronau et al. using 7,176,238 SNPs. (c) Population stratification based on STRUCTURE for K=2. Pie charts represent admixture proportions (that is, percentage of genome composition associated with each ancestral subpopulation cluster), and are placed on the map in the traditionally recognized country of origin of each cultivar.
Figure 3
Figure 3. Evidence of inbreeding in date palm genomes.
(a) Density distribution of the proportion of heterozygous genotypes in 50 kb windows for four date palm cultivars. Peaks in the distributions at low heterozygosity are apparent in the more inbred samples (b) An example run of homozygosity (ROH) in cv. Ajwa compared with cv. Mazafati on genome assembly scaffold S000023. Lines are loess fits to the proportion of heterozygous genotypes in non-overlapping 20 kb windows.
Figure 4
Figure 4. Identification of candidate selective sweep regions.
The highlighted region on scaffold S000007 shows a reduction in diversity in samples from the Middle East in a region containing a pectin lyase gene, which is a candidate fruit ripening locus. Shown are the Z-score transformation of log2(θMiddle East/θNorth Africa) in 5-kb windows across the longest 30 scaffolds in the genome assembly. Points from the negative half of the Z-score distribution are shown. θw and FST tracks show estimates from 5 kb and 500-bp windows. The θW track shows estimates in sliding windows of 5 kb with step size of 500 bp. The FST track shows estimates in 500-bp non-overlapping windows.
Figure 5
Figure 5. Variation in selected genes and pathways of interest in date palm.
(a) Major effect mutations in selected pathways and large gene families in the date palm genome. (b) Evidence of elevated neutral polymorphism in regions containing NBS–LRR, CC–NBS–LRR and RLP genes consistent with long-term balancing selection acting on these gene families. Bars represent mean θW in 10-kb windows that contain at least one putative disease resistance gene in the indicated class compared with windows without these genes. P values are based on a two-tailed Wilcoxon rank-sum test (***P<10−10; **P<10−5). (c) Date palm cultivars with predicted deletion alleles at resistance/immunity-related gene loci in African (dark orange) and Middle Eastern (dark blue) samples. Samples without deletions are colored light yellow and light blue.
Figure 6
Figure 6. Fruit colour polymorphism in date palm is likely controlled by a dominant negative mutation in an R2R3 myb-like transcription factor.
(a) Khalal stage fruits in cv. Barhee and cv. Khisab illustrating the red/yellow fruit colour polymorphism in date palm, respectively. (b) Phylogeny of selected members of the R2R3 myb-like transcription factor family. The analysis supports the orthology of date palm and oil palm VIR genes. (c) Alignment of the C-terminal end of the VIR alleles in date palm and the orthologous gene in oil palm. (d) Representation of VIR+ and VIRcopia alleles found at the date palm orthologue of the oil palm R2R3 myb-like VIR gene, defined based on a pre-mature stop codon in exon 3 that truncates the protein in the cv. Khalas reference genome relative to the wild-type (VIR+) allele. The stop codon is introduced by a copia-like retrotransposon element insertion. (e) Genotype counts based on the alleles in (d) showing perfect concordance between VIR genotype and the fruit colour phenotype. Yellow represents a range of intermediate colours between yellow and red, including golden yellow and orange yellow (Supplementary Table 10). Samples with missing genotypes or whose fruit colour cannot be verified were excluded. This suggests that yellow fruit colour in both oil palm and date palm is caused by truncated alleles that act as dominant negative mutations.

References

    1. Meyer R. S. & Purugganan M. D. Evolution of crop species: genetics of domestication and diversification. Nat. Rev. Genet. 14, 840–852 (2013). - PubMed
    1. Purugganan M. D. & Fuller D. The nature of selection during plant domestication. Nature 457, 843–848 (2009). - PubMed
    1. Larson G. et al.. Current perspectives and the future of domestication studies. Proc. Natl Acad. Sci. USA 111, 6139–6146 (2014). - PMC - PubMed
    1. Xu X. et al.. Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes. Nat. Biotechnol. 30, 105–111 (2012). - PubMed
    1. Wang M. et al.. The genome sequence of African rice (Oryza glaberrima) and evidence for independent domestication. Nat. Genet. 46, 982–988 (2014). - PMC - PubMed

Publication types