Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Feb 8;8(1):5.
doi: 10.3390/life8010005.

Intrinsic Properties of tRNA Molecules as Deciphered via Bayesian Network and Distribution Divergence Analysis

Affiliations

Intrinsic Properties of tRNA Molecules as Deciphered via Bayesian Network and Distribution Divergence Analysis

Sergio Branciamore et al. Life (Basel). .

Abstract

The identity/recognition of tRNAs, in the context of aminoacyl tRNA synthetases (and other molecules), is a complex phenomenon that has major implications ranging from the origins and evolution of translation machinery and genetic code to the evolution and speciation of tRNAs themselves to human mitochondrial diseases to artificial genetic code engineering. Deciphering it via laboratory experiments, however, is difficult and necessarily time- and resource-consuming. In this study, we propose a mathematically rigorous two-pronged in silico approach to identifying and classifying tRNA positions important for tRNA identity/recognition, rooted in machine learning and information-theoretic methodology. We apply Bayesian Network modeling to elucidate the structure of intra-tRNA-molecule relationships, and distribution divergence analysis to identify meaningful inter-molecule differences between various tRNA subclasses. We illustrate the complementary application of these two approaches using tRNA examples across the three domains of life, and identify and discuss important (informative) positions therein. In summary, we deliver to the tRNA research community a novel, comprehensive methodology for identifying the specific elements of interest in various tRNA molecules, which can be followed up by the corresponding experimental work and/or high-resolution position-specific statistical analyses.

Keywords: bayesian networks; distribution divergence; information theory; operational code; tRNA identity; tRNA recognition.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.The founding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, and in the decision to publish the results.

Figures

Figure 1
Figure 1
tRNA sequence alignment. The first row corresponds to the “standard” numbering scheme. The second row is the consensus sequence. Structural parts of tRNA molecules are highlighted in color (see text for details).
Figure 2
Figure 2
Bayesian network built from the full set of tRNA sequences. (a) direct visualization of the PDAG (Probabilistic Directed Acyclic Graph); (b) same, superimposed on the secondary tRNA structure. Nodes in the network correspond to the variables (specifically, tRNA positions, as enumerated in Figure 1, first row), and edges to the dependencies between the variables. “Boldness” of the edge is proportional to the dependency strength, also indicated by the number shown next to the edge. See text for BN construction details. (Label “100” does not refer to a tRNA position but rather is a placeholder for the cognate aa variable, appearing here for the technical convenience reasons only; directionality of the edge (arrow) is for mathematical convenience reasons only as well, and does not imply causation.)
Figure 3
Figure 3
Position “importance” profile for Gly tRNAs, shown for three life domains: Archaea (a,d), Bacteria (b,e) and Eukarya (c,f). Relative Entropy is shown as function of tRNA position (ac) (enumerated as in Figure 1, first row), or visualized as color intensity superimposed over the secondary tRNA structure (df). Significance cutoff limit is shown as a red line in (ac)—see text for discussion.
Figure 4
Figure 4
Summary visualization of the tRNA position “importance”, for all aa tRNA subclasses, shown for three life domains: Archaea (a); Bacteria (b) and Eukarya (c). Higher values correspond to “hotter” colors. tRNA position numbering is as in Figure 3. This is a summary visualization of the detailed plots presented in Supplemental Figures S1–S22.

References

    1. Schimmel P. Development of tRNA synthetases and connection to genetic code and disease. Protein Sci. 2008;17:1643–1652. doi: 10.1110/ps.037242.108. - DOI - PMC - PubMed
    1. Giegé R., Eriani G. eLS. John Wiley & Sons, Ltd.; Hoboken, NJ, USA: 2014. Transfer RNA Recognition and Aminoacylation by Synthetases.
    1. Eriani G., Karam J., Jacinto J., Richard E.M., Geslain R. MIST, a Novel Approach to Reveal Hidden Substrate Specificity in Aminoacyl-tRNA Synthetases. PLoS ONE. 2015;10:e0130042. doi: 10.1371/journal.pone.0130042. - DOI - PMC - PubMed
    1. Cvetesic N., Gruic-Sovulj I. Synthetic and editing reactions of aminoacyl-tRNA synthetases using cognate and non-cognate amino acid substrates. Methods. 2017;113:13–26. doi: 10.1016/j.ymeth.2016.09.015. - DOI - PubMed
    1. Sapienza P., Li L., Williams T., Lee A., Carter C.W., Jr. An Ancestral Tryptophanyl-tRNA Synthetase Precursor Achieves High Catalytic Rate Enhancement without Ordered Ground-State Tertiary Structures. ACS Chem. Biol. 2016;11:1661–1668. doi: 10.1021/acschembio.5b01011. - DOI - PMC - PubMed

LinkOut - more resources