Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2005 Jan 28:3:2.
doi: 10.1186/1741-7007-3-2.

Comparative analysis of protein coding sequences from human, mouse and the domesticated pig

Affiliations
Comparative Study

Comparative analysis of protein coding sequences from human, mouse and the domesticated pig

Frank Grønlund Jørgensen et al. BMC Biol. .

Abstract

Background: The availability of abundant sequence data from key model organisms has made large scale studies of molecular evolution an exciting possibility. Here we use full length cDNA alignments comprising more than 700,000 nucleotides from human, mouse, pig and the Japanese pufferfish Fugu rubrices in order to investigate 1) the relationships between three major lineages of mammals: rodents, artiodactyls and primates, and 2) the rate of evolution and the occurrence of positive Darwinian selection using codon based models of sequence evolution.

Results: We provide evidence that the evolutionary splits among primates, rodents and artiodactyls happened shortly after each other, with most gene trees favouring a topology with rodents as outgroup to primates and artiodactyls. Using an unrooted topology of the three mammalian species we show that since their diversification, the pig and mouse lineages have on average experienced 1.44 and 2.86 times as many synonymous substitutions as humans, respectively, whereas the rates of non-synonymous substitutions are more similar. The analysis shows the highest average dN/dS ratio in the human lineage, followed by the pig and then the mouse lineages. Using codon based models we detect signals of positive Darwinian selection in approximately 5.3%, 4.9% and 6.0% of the genes on the human, pig and mouse lineages respectively. Approximately 16.8% of all the genes studied here are not currently annotated as functional genes in humans. Our analyses indicate that a large fraction of these genes may have lost their function quite recently or may still be functional genes in some or all of the three mammalian species.

Conclusions: We present a comparative analysis of protein coding genes from three major mammalian lineages. Our study demonstrates the usefulness of codon-based likelihood models in detecting selection and it illustrates the value of sequencing organisms at different phylogenetic distances for comparative studies.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Phylogenetic tree of key mammalian species. A schematic drawing showing the topologies considered in our study compared to a recent study on human, chimpanzee and mouse trios [23]. Branch a shows the branch considered in the study by Clark et al (2003) while branch a+b represents the evolutionary time scale studied here.
Figure 2
Figure 2
Distribution of sequence alignment lengths. Histogram showing the distribution of sequence lengths in the three species alignments.
Figure 3
Figure 3
Conflicting mammalian phylogenies. A schematic drawing of the three conflicting bifurcating topologies (a-c) as well as a multifurcating alternative (d). The divergence times shown in (a) are million years from present [31].
Figure 4
Figure 4
Evolutionary rates. Histograms of key parameters in the codon models. (a-c) The rate of synonymous substitutions per synonymous site (dS) in the pig, human and mouse lineage respectively. (d-f) The rate of nonsynonymous substitutions per nonsynonymous site (dN) in the pig, human and mouse lineage respectively. (g-h) The ratio of nonsynonymous substitutions to synonymous substitutions (dN/dS ratio) in the pig, human and mouse lineage respectively. The horizontal line represents the mean of the distributions.

Similar articles

Cited by

References

    1. Li WH, Gouy M, Sharp PM, O'HUigin C, Yang YW. Molecular phylogeny of Rodentia, Lagomorpha, Primates, Artiodactyla, and Carnivora and molecular clocks. Proc Natl Acad Sci U S A. 1990;87:6703–6707. - PMC - PubMed
    1. Thomas JW, Touchman JW, Blakesley RW, Bouffard GG, Beckstrom-Sternberg SM, Margulies EH, Blanchette M, Siepel AC, Thomas PJ, McDowell JC, Maskeri B, Hansen NF, Schwartz MS, Weber RJ, Kent WJ, Karolchik D, Bruen TC, Bevan R, Cutler DJ, Schwartz S, Elnitski L, Idol JR, Prasad AB, Lee-Lin SQ, Maduro VV, Summers TJ, Portnoy ME, Dietrich NL, Akhter N, Ayele K, Benjamin B, Cariaga K, Brinkley CP, Brooks SY, Granite S, Guan X, Gupta J, Haghighi P, Ho SL, Huang MC, Karlins E, Laric PL, Legaspi R, Lim MJ, Maduro QL, Masiello CA, Mastrian SD, McCloskey JC, Pearson R, Stantripop S, Tiongson EE, Tran JT, Tsurgeon C, Vogt JL, Walker MA, Wetherby KD, Wiggins LS, Young AC, Zhang LH, Osoegawa K, Zhu B, Zhao B, Shu CL, De Jong PJ, Lawrence CE, Smit AF, Chakravarti A, Haussler D, Green P, Miller W, Green ED. Comparative analyses of multi-species sequences from targeted genomic regions. Nature. 2003;424:788–793. doi: 10.1038/nature01858. - DOI - PubMed
    1. Janke A, Feldmaier-Fuchs G, Thomas WK, von Haeseler A, Paabo S. The marsupial mitochondrial genome and the evolution of placental mammals. Genetics. 1994;137:243–256. - PMC - PubMed
    1. Misawa K, Janke A. Revisiting the Glires concept – phylogenetic analysis of nuclear sequences. Mol Phylogenet Evol. 2003;28:320–327. doi: 10.1016/S1055-7903(03)00079-4. - DOI - PubMed
    1. Easteal S. The pattern of mammalian evolution and the relative rate of molecular evolution. Genetics. 1990;124:165–173. - PMC - PubMed

Publication types

LinkOut - more resources