Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2006 Jun;18(6):1348-59.
doi: 10.1105/tpc.106.041665. Epub 2006 Apr 21.

Comparative genomics of Brassica oleracea and Arabidopsis thaliana reveal gene loss, fragmentation, and dispersal after polyploidy

Affiliations
Comparative Study

Comparative genomics of Brassica oleracea and Arabidopsis thaliana reveal gene loss, fragmentation, and dispersal after polyploidy

Christopher D Town et al. Plant Cell. 2006 Jun.

Abstract

We sequenced 2.2 Mb representing triplicated genome segments of Brassica oleracea, which are each paralogous with one another and homologous with a segmentally duplicated region of the Arabidopsis thaliana genome. Sequence annotation identified 177 conserved collinear genes in the B. oleracea genome segments. Analysis of synonymous base substitution rates indicated that the triplicated Brassica genome segments diverged from a common ancestor soon after divergence of the Arabidopsis and Brassica lineages. This conclusion was corroborated by phylogenetic analysis of protein families. Using A. thaliana as an outgroup, 35% of the genes inferred to be present when genome triplication occurred in the Brassica lineage have been lost, most likely via a deletion mechanism, in an interspersed pattern. Genes encoding proteins involved in signal transduction or transcription were not found to be significantly more extensively retained than those encoding proteins classified with other functions, but putative proteins predicted in the A. thaliana genome were underrepresented in B. oleracea. We identified one example of gene loss from the Arabidopsis lineage. We found evidence for the frequent insertion of gene fragments of nuclear genomic origin and identified four apparently intact genes in noncollinear positions in the B. oleracea and A. thaliana genomes.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Phylogenetic Relationships of Conserved Genes. Left, overview of the positions of the members of the analyzed gene family across the sequence contigs. Right, phylogenies illustrated as maximum likelihood cladograms. Bootstrap support, calculated from 1000 replicates, is shown at each branch as a percentage.
Figure 2.
Figure 2.
Alignment of Conserved Genes. Vertical lines denote sequence contigs, and horizontal lines join members of homologous/paralogous gene families. The triangle by each gene model name indicates the coding stand of the gene and is color-coded to indicate the Munich Information Center for Protein Sequences FunCat functional classification of the predicted protein, as shown in the key at bottom.
Figure 3.
Figure 3.
Phylogenic Analysis of an Inverted Duplicate Array Present in Both A. thaliana Daughter Segments of the α Genome Duplication. Left, overview of the positions of the members of the analyzed gene family across the sequence contigs. Right, phylogeny illustrated as a maximum likelihood cladogram. Bootstrap support, calculated from 1000 replicates, is shown at each branch as a percentage.
Figure 4.
Figure 4.
Phylogenic Analysis of a Parallel Triplicate Array Present in Both A. thaliana Daughter Segments of the α Genome Duplication. Left, overview of the positions of the members of the analyzed gene family across the sequence contigs. Right, phylogeny illustrated as a maximum likelihood cladogram. Bootstrap support, calculated from 1000 replicates, is shown at each branch as a percentage. The subgroup represents a cluster of sequences (At5g47350, B.m00415, At5g47340, and At5g47330) for which a robust phylogeny could not be constructed.

References

    1. Arabidopsis Genome Initiative (2000). Analysis of the genome of the flowering plant Arabidopsis thaliana. Nature 408 796–815. - PubMed
    1. Arumuganthan, K., and Earle, E.D. (1991). Nuclear DNA content of some important plant species. Plant Mol. Biol. Rep. 9 208–218.
    1. Babula, D., Kaczmarek, M., Barakat, A., Delseney, M., Quiros, C.F., and Sadowski, J. (2003). Chromosomal mapping of Brassica oleracea based on ESTs from Arabidopsis thaliana: Complexity of the comparative map. Mol. Genet. Genomics 268 656–665. - PubMed
    1. Bairoch, A., and Apweiler, R. (2000). The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 28 45–48. - PMC - PubMed
    1. Bancroft, I. (2000). Insights into the structural and functional evolution of plant genomes afforded by the nucleotide sequences of chromosomes 2 and 4 of Arabidopsis thaliana. Yeast 17 1–5. - PMC - PubMed

Publication types

Substances