Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;8(11):e1002752.
doi: 10.1371/journal.pcbi.1002752. Epub 2012 Nov 1.

Comparative analysis of RNA families reveals distinct repertoires for each domain of life

Affiliations

Comparative analysis of RNA families reveals distinct repertoires for each domain of life

Marc P Hoeppner et al. PLoS Comput Biol. 2012.

Abstract

The RNA world hypothesis, that RNA genomes and catalysts preceded DNA genomes and genetically-encoded protein catalysts, has been central to models for the early evolution of life on Earth. A key part of such models is continuity between the earliest stages in the evolution of life and the RNA repertoires of extant lineages. Some assessments seem consistent with a diverse RNA world, yet direct continuity between modern RNAs and an RNA world has not been demonstrated for the majority of RNA families, and, anecdotally, many RNA functions appear restricted in their distribution. Despite much discussion of the possible antiquity of RNA families, no systematic analyses of RNA family distribution have been performed. To chart the broad evolutionary history of known RNA families, we performed comparative genomic analysis of over 3 million RNA annotations spanning 1446 families from the Rfam 10 database. We report that 99% of known RNA families are restricted to a single domain of life, revealing discrete repertoires for each domain. For the 1% of RNA families/clans present in more than one domain, over half show evidence of horizontal gene transfer (HGT), and the rest show a vertical trace, indicating the presence of a complex protein synthesis machinery in the Last Universal Common Ancestor (LUCA) and consistent with the evolutionary history of the most ancient protein-coding genes. However, with limited interdomain transfer and few RNA families exhibiting demonstrable antiquity as predicted under RNA world continuity, our results indicate that the majority of modern cellular RNA repertoires have primarily evolved in a domain-specific manner.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Venn diagram of RNA family distribution.
Taxonomic information attached to EMBL-derived Rfam annotations reveals that the majority (99%) of RNA families are domain-specific, with only seven RNA families universally conserved (across the three domains of life plus viruses; Table S1 in Text S1). Numbers within dashed circles indicate viral RNA families.
Figure 2
Figure 2. RNA-based processes traceable to the Last Universal Common Ancestor.
Universal Rfam families that show evidence of vertical inheritance (Table S1 in Text S1) are all associated with the processes of translation (rRNAs, tRNAs, RNase P) and protein export (SRP RNA). A previous study examining the antiquity of protein coding genes identified only 37 universally distributed proteins which show evidence of vertical inheritance. The majority of these vertically inherited proteins are associated with translation and protein export; numbers of such proteins associated with each of the depicted processes is given in grey (original data are from Harris [21]). The proteins associated with RNase P are not universally conserved, with archaeal and eukaryotic RNase P proteins being unrelated to their bacterial counterparts . While tRNA synthetases are universal, they have undergone ancient horizontal gene transfer events , which complicates establishing the timing of their origin.
Figure 3
Figure 3. Reconstruction of broadly distributed RNA repertoires for each domain, plus interdomain RNA families.
Colored bars at far right indicate normalized taxonomic abundance of each Rfam for major taxonomic groupings within each domain. Horizontal traces (see text, Table S1 in Text S1) for interdomain families, are depicted as follows: general transfer patterns are given by dashed arrows; proposed HGT patterns for individual families are depicted by number (inset). For Rfam families present in more than one domain (far left and inset), bars indicate normalized taxonomic abundance by domain (color scheme at bottom left). Asterisks indicate additional broadly-distributed bacterial candidates identified using GEBA tree topology (see text). Note that the Rfam rRNA families in Rfam 10.0 are based on conserved subsequences, and are not as comprehensive as other resources (see main text) and are included here for consistency. The universally-distributed rRNAs are the small subunit (16/18S) rRNA, large subunit (23/28S) rRNA and 5S rRNA (see Table S1 in Text S1). The 5.8S rRNA of eukaryotes is known to be homologous to the 5′ end of bacterial and archaeal 23S rRNA , , so its inclusion as a eukaryote-specific family in Rfam is in this respect artefactual.
Figure 4
Figure 4. Rfam-based functional classification of RNA families.
The tree depicts classification of the higher level data structures within Rfam, and is not a phylogeny. Numbers of sequences and families in Rfam 10 that fall into each functional classification are shown as bar charts. Domain-level taxonomic distribution for each functional category is shown by black (present) and white (absent) boxes, right. The grey box indicates that H/ACA family RNAs are known from archaea , , but are not in Rfam 10.

Similar articles

Cited by

References

    1. Fraenkel-Conrat H (1956) The role of the nucleic acid in the reconstitution of active Tobacco Mosaic Virus. Journal of the American Chemical Society 78: 882–883.
    1. Gierer A, Schramm G (1956) Infectivity of ribonucleic acid from Tobacco Mosaic Virus. Nature 177: 702–703. - PubMed
    1. Diener TO (1971) Potato spindle tuber “virus”. IV. A replicating, low molecular weight RNA. Virology 45: 411–428. - PubMed
    1. Kruger K, Grabowski PJ, Zaug AJ, Sands J, Gottschling DE, et al. (1982) Self-splicing RNA: autoexcision and autocyclization of the ribosomal RNA intervening sequence of Tetrahymena. Cell 31: 147–157. - PubMed
    1. Guerrier-Takada C, Gardiner K, Marsh T, Pace N, Altman S (1983) The RNA moiety of ribonuclease P is the catalytic subunit of the enzyme. Cell 35: 849–857. - PubMed

Publication types