Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;7(5):e36972.
doi: 10.1371/journal.pone.0036972. Epub 2012 May 16.

Phylogenomics of prokaryotic ribosomal proteins

Affiliations

Phylogenomics of prokaryotic ribosomal proteins

Natalya Yutin et al. PLoS One. 2012.

Abstract

Archaeal and bacterial ribosomes contain more than 50 proteins, including 34 that are universally conserved in the three domains of cellular life (bacteria, archaea, and eukaryotes). Despite the high sequence conservation, annotation of ribosomal (r-) protein genes is often difficult because of their short lengths and biased sequence composition. We developed an automated computational pipeline for identification of r-protein genes and applied it to 995 completely sequenced bacterial and 87 archaeal genomes available in the RefSeq database. The pipeline employs curated seed alignments of r-proteins to run position-specific scoring matrix (PSSM)-based BLAST searches against six-frame genome translations, mitigating possible gene annotation errors. As a result of this analysis, we performed a census of prokaryotic r-protein complements, enumerated missing and paralogous r-proteins, and analyzed the distributions of ribosomal protein genes among chromosomal partitions. Phyletic patterns of bacterial and archaeal r-protein genes were mapped to phylogenetic trees reconstructed from concatenated alignments of r-proteins to reveal the history of likely multiple independent gains and losses. These alignments, available for download, can be used as search profiles to improve genome annotation of r-proteins and for further comparative genomics studies.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Overall scheme of the procedure.
Figure 2
Figure 2. Bacterial phylogenetic tree reconstructed from a concatenated alignment of 50 nearly ubiquitous r-proteins.
Green boxes denoted as I, II, and III mark three putative “megaphyla” discussed in the text. Branches having bootstrap support values less than 0.5 were collapsed.
Figure 3
Figure 3. Phyletic patterns of r-proteins that are placed in the last common ancestor of Bacteria by Dollo parsimony.
Proteins having full phyletic pattern are listed in black font. Losses are marked by a strikethrough font. Numbers in parentheses following taxonomic group names represent number of species in that group. Numbers in parentheses following r-protein names represent number of species on that branch that have lost this r-protein.
Figure 4
Figure 4. Phyletic distribution of non-ubiquitous bacterial r-proteins that according to the parsimony reconstruction do not appear to be ancestral.
Numbers in parentheses following taxonomic group names represent number of species in that group. Numbers in parentheses following r-protein names represent number of species on that branch that have this r-protein. L7ae on branch means all genomes of this branch included in the dataset have this r-protein. Asterisks point on two phyla where S31 protein has been found in genomes that were not included in the dataset.
Figure 5
Figure 5. Archaeal phylogenetic tree reconstructed from a concatenated alignment of 56 ubiquitous r-proteins.
Figure 6
Figure 6. Phyletic distribution of twelve non-ubiquitous archaeal r-proteins.
Numbers in parentheses and strikethrough font mean the same as on Figure 3.
Figure 7
Figure 7. Three-domain phylogenetic tree reconstructed from a concatenated alignment of 32 universal r-proteins.
Branches having bootstrap support values less than 0.5 were collapsed.

References

    1. Ramakrishnan V. Ribosome structure and the mechanism of translation. Cell. 2002;108:557–572. - PubMed
    1. Ban N, Nissen P, Hansen J, Moore PB, Steitz TA. The complete atomic structure of the large ribosomal subunit at 2.4 A resolution. Science. 2000;289:905–920. - PubMed
    1. Wimberly BT, Brodersen DE, Clemons WM, Morgan-Warren RJ, Carter AP, et al. Structure of the 30S ribosomal subunit. Nature. 2000;407:327–339. - PubMed
    1. Schluenzen F, Tocilj A, Zarivach R, Harms J, Gluehmann M, et al. Structure of functionally activated small ribosomal subunit at 3.3 angstroms resolution. Cell. 2000;102:615–623. - PubMed
    1. Williamson JR. The ribosome at atomic resolution. Cell. 2009;139:1041–1043. - PubMed

Publication types

LinkOut - more resources