Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 1998 Apr 14;95(8):4437-40.
doi: 10.1073/pnas.95.8.4437.

Gene number in an invertebrate chordate, Ciona intestinalis

Affiliations
Comparative Study

Gene number in an invertebrate chordate, Ciona intestinalis

M W Simmen et al. Proc Natl Acad Sci U S A. .

Abstract

Gene number can be considered a pragmatic measure of biological complexity, but reliable data is scarce. Estimates for vertebrates are 50-100,000 genes per haploid genome, whereas invertebrate estimates fall below 25,000. We wished to test the hypothesis that the origin of vertebrates coincided with extensive gene creation. A prediction is that gene number will differ sharply between invertebrate and vertebrate members of the chordate phylum. A gene number estimation method requiring limited sequence sampling of genomic DNA was developed and validated by using data for Caenorhabditis elegans. Using the method, we estimated that the invertebrate chordate Ciona intestinalis has 15,500 protein-coding genes (+/-3,700). This number is significantly lower than gene numbers of vertebrate chordates, but similar to those of invertebrates in distantly related phyla. The data indicate that evolution of vertebrates was accompanied by a dramatic increase in protein-coding capacity of the genome.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Gene number estimates. Shaded bars denote results based on completed genome projects. The error bars should be regarded as approximate indicators of uncertainty only, as statistically comparable values are generally not available. This is not a comprehensive summary of data from all available species. Current estimates for mouse (3, 21) and pufferfish (22) are not shown but fall within the range shown for human. The C. elegans estimate, 16,527 ± 260, is the most up-to-date figure as calculated by the method used in ref. . Other data are from refs. , , , , , and and references therein.

References

    1. Ohno S. Evolution by Gene Duplication. Heidelberg: Springer; 1970.
    1. Holland, P. W. H., Garcia-Fernàndez, J., Williams, N. A. & Sidow, A. (1994) Development (Cambridge, U.K.), Suppl., 125–133. - PubMed
    1. Bird A P. Trends Genet. 1995;11:94–100. - PubMed
    1. Waterston R, Martin C, Craxton M, Huynh C, Coulson A, Hillier L, Durbin R, Green P, Shownkeen R, Halloran N, et al. Nat Genet. 1992;1:114–123. - PubMed
    1. Cavalier-Smith T. The Evolution of Genome Size. Chichester: Wiley; 1985.

Publication types

Associated data

LinkOut - more resources