Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Mar 27:9:36.
doi: 10.1186/1471-2229-9-36.

A new genomic resource dedicated to wood formation in Eucalyptus

Affiliations

A new genomic resource dedicated to wood formation in Eucalyptus

David Rengel et al. BMC Plant Biol. .

Abstract

Background: Renowned for their fast growth, valuable wood properties and wide adaptability, Eucalyptus species are amongst the most planted hardwoods in the world, yet they are still at the early stages of domestication because conventional breeding is slow and costly. Thus, there is huge potential for marker-assisted breeding programs to improve traits such as wood properties. To this end, the sequencing, analysis and annotation of a large collection of expressed sequences tags (ESTs) from genes involved in wood formation in Eucalyptus would provide a valuable resource.

Results: We report here the normalization and sequencing of a cDNA library from developing Eucalyptus secondary xylem, as well as the construction and sequencing of two subtractive libraries (juvenile versus mature wood and vice versa). A total of 9,222 high quality sequences were collected from about 10,000 cDNA clones. The EST assembly generated a set of 3,857 wood-related unigenes including 2,461 contigs (Cg) and 1,396 singletons (Sg) that we named 'EUCAWOOD'. About 65% of the EUCAWOOD sequences produced matches with poplar, grapevine, Arabidopsis and rice protein sequence databases. BlastX searches of the Uniref100 protein database allowed us to allocate gene ontology (GO) and protein family terms to the EUCAWOOD unigenes. This annotation of the EUCAWOOD set revealed key functional categories involved in xylogenesis. For instance, 422 sequences matched various gene families involved in biosynthesis and assembly of primary and secondary cell walls. Interestingly, 141 sequences were annotated as transcription factors, some of them being orthologs of regulators known to be involved in xylogenesis. The EUCAWOOD dataset was also mined for genomic simple sequence repeat markers, yielding a total of 639 putative microsatellites. Finally, a publicly accessible database was created, supporting multiple queries on the EUCAWOOD dataset.

Conclusion: In this work, we have identified a large set of wood-related Eucalyptus unigenes called EUCAWOOD, thus creating a valuable resource for functional genomics studies of wood formation and molecular breeding in this economically important genus. This set of publicly available annotated sequences will be instrumental for candidate gene approaches, custom array development and marker-assisted selection programs aimed at improving and modulating wood properties.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Size distribution of the EUCAWOOD unigenes after assembly.
Figure 2
Figure 2
Overlap between the EUCAWOOD unigenes. (A) Venn diagram showing the overlap between unigenes originating from the cDNA library [XylcDNA] and each of the SSH libraries [Jm: juvenile vs mature secondary xylem; Mj: mature vs juvenile secondary xylem; Xl: secondary xylem vs leaves; Xp: secondary xylem vs secondary phloem]. (B) Venn diagram showing the overlap of unigenes derived from the four different SSH libraries.
Figure 3
Figure 3
Number of EUCAWOOD unigenes with similarities to predicted proteins from four plant species. BLASTX searches (E value ≤ e-10) were conducted to identify EUCAWOOD unigenes in the JGI Poplar Proteins v1.1, Arabidopsis TAIR7 Peptides, TIGR Rice Genome Annotation and NCBI (Vitis vinifera) databases.
Figure 4
Figure 4
Gene ontology assignments to EUCAWOOD unigenes. GO terms were allocated to EUCAWOOD unigenes according to their best hit in searches of the Uniref100 database (E value ≤ e-10). Terms and IDs belonging to the 'Biological Process' and 'Molecular Function' categories are shown. Black bars indicate the main subcategories whereas the grey bars immediately below them illustrate subcategories therein. (Terms and IDs belonging to 'Cellular Component' category can be found in Additional file 1.)
Figure 5
Figure 5
Protein families among EUCAWOOD unigenes. A total of 825 protein families from the PFAM protein family database were represented in the EUCAWOOD dataset. The black bars indicate the occurrence of the 20 most abundant protein families.

References

    1. Boudet AM, Kajita S, Grima-Pettenati J, Goffner D. Lignins and lignocellulosics: a better control of synthesis for new and improved uses. Trends Plant Sci. 2003;8:576–581. doi: 10.1016/j.tplants.2003.10.001. - DOI - PubMed
    1. Plomion C, Leprovost G, Stokes A. Wood formation in trees. Plant Physiol. 2001;127:1513–1523. doi: 10.1104/pp.010816. - DOI - PMC - PubMed
    1. Zobel B, Sprague J. Juvenile wood in forest trees. Berlin: Springer-Verlag; 1998.
    1. Myburg AA, Potts BM, Marques CMP, Kirst M, Gion J-M, Grattapaglia D, Grima-Pettenati J. Eucalyptus. In: Kole CR, editor. Genome Mapping & Molecular Breeding in Plants Forest Trees. Vol. 7. Heidelberg: Springer; 2007. ISBN: 978-3-540-34540-4.
    1. Food and Agriculture Organization of the United Nations (Global Forest Resources Assessment, 2005) http://www.fao.org/docrep/008/a0400e/a0400e0400.htm

Publication types

LinkOut - more resources