Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jul 1:6:82.
doi: 10.1038/s41438-019-0164-0. eCollection 2019.

Draft genome sequence of cauliflower (Brassica oleracea L. var. botrytis) provides new insights into the C genome in Brassica species

Affiliations

Draft genome sequence of cauliflower (Brassica oleracea L. var. botrytis) provides new insights into the C genome in Brassica species

Deling Sun et al. Hortic Res. .

Abstract

Cauliflower is an important variety of Brassica oleracea and is planted worldwide. Here, the high-quality genome sequence of cauliflower was reported. The assembled cauliflower genome was 584.60 Mb in size, with a contig N50 of 2.11 Mb, and contained 47,772 genes; 56.65% of the genome was composed of repetitive sequences. Among these sequences, long terminal repeats (LTRs) were the most abundant (32.71% of the genome), followed by transposable elements (TEs) (12.62%). Comparative genomic analysis confirmed that after an ancient paleohexaploidy (γ) event, cauliflower underwent two whole-genome duplication (WGD) events shared with Arabidopsis and an additional whole-genome triplication (WGT) event shared with other Brassica species. The present cultivated cauliflower diverged from the ancestral B. oleracea species ~3.0 million years ago (Mya). The speciation of cauliflower (~2.0 Mya) was later than that of B. oleracea L. var. capitata (approximately 2.6 Mya) and other Brassica species (over 2.0 Mya). Chromosome no. 03 of cauliflower shared the most syntenic blocks with the A, B, and C genomes of Brassica species and its eight other chromosomes, implying that chromosome no. 03 might be the most ancient one in the cauliflower genome, which was consistent with the chromosome being inherited from the common ancestor of Brassica species. In addition, 2,718 specific genes, 228 expanded genes, 2 contracted genes, and 1,065 positively selected genes in cauliflower were identified and functionally annotated. These findings provide new insights into the genomic diversity of Brassica species and serve as a valuable reference for molecular breeding of cauliflower.

Keywords: Genome assembly algorithms; Plant evolution.

PubMed Disclaimer

Conflict of interest statement

Conflict of interestThe authors declare that they have no conflict of interest.

Figures

Fig. 1
Fig. 1. Distribution of genes in cauliflower and other representative plant species.
a Distribution of single-copy orthologs (pink), multiple-copy orthologs (orange), unique genes (brown), and other unclassified genes (green) in cauliflower and other representative plant species. b Venn diagram of gene families in cauliflower and two other Brassica species with the C genome
Fig. 2
Fig. 2. Divergence time of cauliflower and other representative plant species.
The nodes represent the divergence time from present (million years ago, Mya). The blue number in the brackets indicates the confidence interval of the divergence time. a–c indicate the A, B and C genomes of Brassica species, respectively
Fig. 3
Fig. 3. Synteny analysis of genes in cauliflower, B. nigra, B. rapa and B. oleracea L. var. capitata.
a Syntenic blocks of cauliflower with B. nigra and B. rapa. b Syntenic blocks of cauliflower with B. oleracea L. var. capitata. The numbers indicate the corresponding chromosomes in each species. The detailed syntenic blocks and the associated genes are shown in Tables S6–S13
Fig. 4
Fig. 4. Circos diagram of different elements on the chromosomes of cauliflower.
Chrom. 01, 02, 03, 04, 05, 06, 07, 08, and 09 indicate the nine assembled chromosomes of cauliflower
Fig. 5
Fig. 5. Distribution of 4DTv distances.
The x axis indicates the 4Dtv distance. The y axis indicates the percentage of gene pairs. (C) shows the C subgenome of B. napus Darmor-bzh

References

    1. Nagaharu U. Genome analysis in Brassica with special reference to the experimental formation of B. napus and peculiar mode of fertilication. Jpn J. Bot. 1935;7:389–452.
    1. Li H, et al. Curd development associated gene (CDAG1) in cauliflower (Brassica oleracea L. var. botrytis) could result in enlarged organ size and increased biomass. Plant Sci. 2017;254:82–94. doi: 10.1016/j.plantsci.2016.10.009. - DOI - PubMed
    1. Anthony RG, James PE, Jordan BR. The cDNA sequence of a cauliflower apetala1/squamosa homolog. Plant Physiol. 1995;108:441–442. doi: 10.1104/pp.108.1.441. - DOI - PMC - PubMed
    1. Anthony RG, James PE, Jordan BR. Cauliflower (Brassica oleracea var. botrytis) curd development: the expression of meristem identity genes. J. Exp. Bot. 1996;47:181–188. doi: 10.1093/jxb/47.2.181. - DOI
    1. Cheung KL, Kong AN. Molecular targets of dietary phenethyl isothiocyanate and sulforaphane for cancer chemoprevention. AAPS J. 2010;12:87–97. doi: 10.1208/s12248-009-9162-8. - DOI - PMC - PubMed