Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2007 Nov 19:8:425.
doi: 10.1186/1471-2164-8-425.

Experimental-confirmation and functional-annotation of predicted proteins in the chicken genome

Affiliations
Comparative Study

Experimental-confirmation and functional-annotation of predicted proteins in the chicken genome

Teresia J Buza et al. BMC Genomics. .

Abstract

Background: The chicken genome was sequenced because of its phylogenetic position as a non-mammalian vertebrate, its use as a biomedical model especially to study embryology and development, its role as a source of human disease organisms and its importance as the major source of animal derived food protein. However, genomic sequence data is, in itself, of limited value; generally it is not equivalent to understanding biological function. The benefit of having a genome sequence is that it provides a basis for functional genomics. However, the sequence data currently available is poorly structurally and functionally annotated and many genes do not have standard nomenclature assigned.

Results: We analysed eight chicken tissues and improved the chicken genome structural annotation by providing experimental support for the in vivo expression of 7,809 computationally predicted proteins, including 30 chicken proteins that were only electronically predicted or hypothetical translations in human. To improve functional annotation (based on Gene Ontology), we mapped these identified proteins to their human and mouse orthologs and used this orthology to transfer Gene Ontology (GO) functional annotations to the chicken proteins. The 8,213 orthology-based GO annotations that we produced represent an 8% increase in currently available chicken GO annotations. Orthologous chicken products were also assigned standardized nomenclature based on current chicken nomenclature guidelines.

Conclusion: We demonstrate the utility of high-throughput expression proteomics for rapid experimental structural annotation of a newly sequenced eukaryote genome. These experimentally-supported predicted proteins were further annotated by assigning the proteins with standardized nomenclature and functional annotation. This method is widely applicable to a diverse range of species. Moreover, information from one genome can be used to improve the annotation of other genomes and inform gene prediction algorithms.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Chicken predicted proteins identified from different tissues. Proteomic based analysis was used to demonstrate the in vivo expression of electronically predicted chicken proteins. (A) The number of predicted chicken proteins identified from each tissue, with the proportion of proteins that were identified in more than one tissue indicated. (B) The majority of proteins were identified in more than one tissue.
Figure 2
Figure 2
Chicken – human/mouse orthologs. (A) The number of identified predicted proteins that had either human or mouse 1:1 orthologs. (B) Distribution of orthologs identified by different orthology prediction methods. The 4 most commonly used ortholog prediction tools are Homologene, Ensembl, InParanoid and Treefam. Human/mouse orthologs were identified for 77% of the identified chicken proteins (see additional file 3).
Figure 3
Figure 3
Overview of cellular component transferred to orthologous chicken predicted proteins. The GO annotations are summarized to broad terms of cellular component. These GO annotations are publicly available via the AgBase database [4].
Figure 4
Figure 4
Overview of molecular function transferred to orthologous chicken predicted proteins. The GO annotations are summarized to broad terms of molecular function. These GO annotations are publicly available via the AgBase database [4].
Figure 5
Figure 5
Overview of biological processes transferred to orthologous chicken predicted proteins. The GO annotations are summarized to broad terms of biological processes. These GO annotations are publicly available via the AgBase database [4].

References

    1. Alexandersson M, Cawley S, Pachter L. SLAM: cross-species gene finding and alignment with a generalized pair hidden Markov model. Genome Res. 2003;13:496–502. doi: 10.1101/gr.424203. - DOI - PMC - PubMed
    1. Curwen V, Eyras E, Andrews TD, Clarke L, Mongin E, Searle SM, Clamp M. The Ensembl automatic gene annotation system. Genome Res. 2004;14:942–950. doi: 10.1101/gr.1858004. - DOI - PMC - PubMed
    1. Wu JQ, Shteynberg D, Arumugam M, Gibbs RA, Brent MR. Identification of rat genes by TWINSCAN gene prediction, RT-PCR, and direct sequencing. Genome Res. 2004;14:665–671. doi: 10.1101/gr.1959604. - DOI - PMC - PubMed
    1. McCarthy FM, Bridges SM, Wang N, Magee GB, Williams WP, Luthe DS, Burgess SC. AgBase: a unified resource for functional analysis in agriculture. Nucleic acids research. 2007;35:D599–603. doi: 10.1093/nar/gkl936. - DOI - PMC - PubMed
    1. McCarthy FM, Wang N, Magee GB, Nanduri B, Lawrence ML, Camon EB, Barrell DG, Hill DP, Dolan ME, Williams WP, Luthe DS, Bridges SM, Burgess SC. AgBase: a functional genomics resource for agriculture. BMC genomics. 2006;7:229. doi: 10.1186/1471-2164-7-229. - DOI - PMC - PubMed

Publication types

LinkOut - more resources