Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Mar;16(3):e9275.
doi: 10.15252/msb.20199275.

Translational efficiency across healthy and tumor tissues is proliferation-related

Affiliations

Translational efficiency across healthy and tumor tissues is proliferation-related

Xavier Hernandez-Alias et al. Mol Syst Biol. 2020 Mar.

Erratum in

Abstract

Different tissues express genes with particular codon usage and anticodon tRNA repertoires. However, the codon-anticodon co-adaptation in humans is not completely understood, nor is its effect on tissue-specific protein levels. Here, we first validated the accuracy of small RNA-seq for tRNA quantification across five human cell lines. We then analyzed the tRNA abundance of more than 8,000 tumor samples from TCGA, together with their paired mRNA-seq and proteomics data, to determine the Supply-to-Demand Adaptation. We thereby elucidate that the dynamic adaptation of the tRNA pool is largely related to the proliferative state across tissues. The distribution of such tRNA pools over the whole cellular translatome affects the subsequent translational efficiency, which functionally determines a condition-specific expression program both in healthy and tumor states. Furthermore, the aberrant translational efficiency of some codons in cancer, exemplified by ProCCA and GlyGGT, is associated with poor patient survival. The regulation of these tRNA profiles is partly explained by the tRNA gene copy numbers and their promoter DNA methylation.

Keywords: tRNA; The Cancer Genome Atlas; codon usage; tissue; translation.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflict of interest.

Figures

Figure 1
Figure 1. tRNA quantification and modifications from small RNA‐seq data
  1. Schematic pipeline for accurate mapping of tRNA reads.

  2. Correlations between tRNA quantifications by small RNA‐seq and Hydro‐tRNAseq of matching (correlations within the same cell line) versus non‐matching (different cell lines) samples. Center values represent the median. The P‐value corresponds to a one‐tailed Wilcoxon rank‐sum test, with n matching = 9 and n non‐matching = 63.

  3. Overlap of the detected tRNA modifications upon variant calling by both methods.

  4. The TCGA network contains small RNA‐seq data alongside mRNA‐seq, DNA methylation arrays, non‐targeted proteomics, and copy number alteration quantification comprising 17 tissues.

Source data are available online for this figure.
Figure EV1
Figure EV1. Correlations between small RNA‐seq and hydro‐tRNAseq protocols, related to Fig 1
  1. The Spearman correlations between all HEK293 tRNA quantifications, including four small RNA‐seq datasets (Data ref: Flores et al, 2014b; Data ref: Mefferd et al, 2015b; Data ref: Torres et al, 2015b,c), four well‐established tRNAseq datasets using Hydro‐tRNAseq and DM‐tRNAseq (Data ref: Zheng et al, 2015b; Data ref: Gogakos et al, 2017b; Data ref: Mattijssen et al, 2017b; Data ref: Benisty et al, 2019b), and one previously published quantification from small RNA‐seq data by Zhang et al (2018). The scatter plots are square‐root‐normalized for better visualization.

  2. Principal component analysis (PCA) and linear discriminant analysis (LDA) of all absolute tRNA quantifications of HCT116, HELA, HEK293, and BJ fibroblasts, both from small RNA‐seq and from Hydro‐tRNAseq data. On the left, the first component of the PCA (method‐related) explains 31.59% variance and the second component (cell‐line‐related) explains 30.83% variance. On the right, the first component of the LDA separates completely all four cell lines, regardless of the method.

Figure EV2
Figure EV2. Comparison between raw sequencing data from small RNA‐seq (left) and Hydro‐tRNAseq (right), related to Fig 1
  1. Length distribution of all reads mapping to tRNA genes.

  2. Fraction of the tRNA sequence that is covered by 1 or more reads, for each of the tRNA isoacceptors.

  3. Absolute number of reads mapping along the sequence of tRNAProAGG. The relative abundance between cell lines is not comparable given their different sequencing depths.

Figure EV3
Figure EV3. Nucleotide modifications of HEK293 on the consensus tRNA structure, related to Fig 1
  1. A, B

    Over the consensus mature tRNA structure of tRNAdb (Jühling et al, 2009), the most frequent detected variants are depicted for both Hydro‐tRNAseq (A) and small RNA‐seq (B). The color code represents the number of tRNAs showing the modification at each specific position of the tRNA model.

  2. C

    Consensus modifications detected by both methods. The color intensity refers to the sum of mismatch calls of the two methods altogether. Refer to Table EV3 for a detailed list of modifications for all cell lines.

Figure 2
Figure 2. Proliferation is the major driver of tissue specificity in tRNAs
  1. Medians of square‐root‐normalized tRNA abundances across all TCGA tissues. The color of the tissue labels corresponds to the average Ki67 expression. Refer to Table EV4 for full cancer type names and number of samples.

  2. Principal component analysis (PCA) of the relative anticodon abundances (RAA, see Materials and Methods) of all healthy samples of TCGA, where the color scale corresponds to the mean tissue expression of Ki67. The Spearman correlations of Ki67 with the components are shown, as well as the samples of most extreme tissues.

  3. Top positive and negative GO terms upon gene set enrichment analysis (GSEA) of the correlations of the first PCA component against all genes.

Data information: Abbreviations stand for BLCA (bladder urothelial carcinoma), BRCA (breast invasive carcinoma), CESC (cervical squamous cell carcinoma and endocervical adenocarcinoma), CHOL (cholangiocarcinoma), COAD (colon adenocarcinoma), ESCA (esophageal carcinoma), GBM (glioblastoma multiforme), HNSC (head and neck squamous cell carcinoma), KICH (kidney chromophobe), KIRC (kidney renal clear cell carcinoma), KIRP (kidney renal papillary cell carcinoma), LIHC (liver hepatocellular carcinoma), LUAD (lung adenocarcinoma), LUSC (lung squamous cell carcinoma), PAAD (pancreatic adenocarcinoma), PCPG (pheochromocytoma and paraganglioma), PRAD (prostate adenocarcinoma), READ (rectum adenocarcinoma), SKCM (skin cutaneous melanoma), STAD (stomach adenocarcinoma), THCA (thyroid carcinoma), THYM (thymoma), UCEC (uterine corpus endometrial carcinoma).
Figure 3
Figure 3. tRNA repertoires determine tissue‐specific translational efficiency
  1. Three metrics of translational efficiency (the classical tAI, a relative tAI with normalized weights within each amino acid family, and the Supply‐to‐Demand Adaptation described in this article) are Spearman correlated against two proxies of translation (protein abundance and protein‐to‐mRNA ratio) for all samples for which proteomics data are available (BRCA, COAD and READ). Center values represent the median. Statistical differences are determined by sample‐paired two‐tailed Wilcoxon rank‐sum test (n = 219).

  2. Principal component analysis (PCA) of the SDAw of TCGA, where the color scale corresponds to the mean tissue expression of Ki67. The Spearman correlations of Ki67 with the components are shown, as well as the samples of most extreme tissues. On the right, the top and bottom proliferation‐ and differentiation‐related codons, as defined by Gingold et al (2014), ordered by their contribution to the first PCA component. Refer to Table EV4 for full cancer type names and number of samples.

  3. GSEA of the differential SDA between extreme tissues (ΔSDA = SDAColorectal ‐ SDABrain), showing the top five GO terms with high (left) and low (right) SDA in colorectal versus glial tissues.

Figure EV4
Figure EV4. Supply‐to‐Demand Adaptation across TCGA samples, related to Fig 3
  1. Scatter plots against the proteome of the breast cancer sample TCGA‐A2‐A0T2‐01A. Three metrics of translational efficiency (the classical tAI, a relative tAI with normalized weights within each amino acid family, and the Supply‐to‐Demand Adaptation described in this article) are plotted against two proxies of translation (protein abundance and protein‐to‐mRNA ratio). Pearson's correlation is only determined for protein abundance, since the assumption of normality is not fulfilled by the ratios.

  2. Pearson's correlations of all samples for which proteomics data are available (BRCA, COAD and READ). Center values represent the median. Statistical differences are determined by sample‐paired two‐tailed Wilcoxon rank‐sum test (n = 219).

  3. Medians of Supply‐to‐Demand Adaptation weights (SDAw) across all TCGA tissues. The dendrograms show a hierarchical clustering among tissues (right) and among SDAw (top). The tissue labels correspond to the average Ki67 expression. Refer to Table EV4 for full cancer type names and number of samples.

Figure EV5
Figure EV5. Differential expression of tRNAs between healthy and tumor samples across 22 cancer types, related to Fig 4
  1. tRNA abundance is measured as relative anticodon abundance, and the fold change is log2(RAATumor/RAAHealthy). Refer to Table EV4 for full cancer type names and number of samples.

  2. Absolute tRNA differential abundance by log2(tRNATumor/tRNAHealthy). Only significant differences are colored, which are determined using a two‐tailed Wilcoxon rank‐sum test and corrected for multiple testing by FDR.

Figure 4
Figure 4. Aberrant translational efficiencies drive tumor progression
  1. Differential SDAw between healthy and tumor samples across 22 cancer types, as measured by log2(SDAwTumor/SDAwHealthy). Only significant differences are colored, which are determined using a two‐tailed Wilcoxon rank‐sum test and corrected for multiple testing by FDR. Refer to Table EV4 for full cancer type names and number of samples.

  2. Boxplot of the SDAw of ProCAA and AlaGCG codons across TCGA cancer types. Boxes expand from the first to the third quartile, with the center values indicating the median. The whiskers define a confidence interval of median ± 1.58*IQR/sqrt(n). Statistical differences are determined using a FDR‐corrected two‐tailed Wilcoxon rank‐sum test ns (P > 0.05), * (P ≤ 0.05), ** (P ≤ 0.01), *** (P ≤ 0.001), **** (P ≤ 0.0001).

  3. Survival curves for the previous codons in KIRC, KIRP, and BLCA patients. The survival analysis was performed for all codons whose SDAw were significantly different in more than 5 cancer types in the one direction with respect to the other [Abs(UP‐DOWN) > 5] and correspondingly corrected for multiple comparisons using FDR.

Figure 5
Figure 5. Promoter methylation and gene copy number regulate tRNA abundance
  1. A multiple linear regression (MLR) between square‐root‐normalized tRNA abundance and the average promoter methylation (450K BeadChip array) and gene copy number at the isoacceptor level. Among all MLRs for each isoacceptor and each cancer type separately, the dots show the FDR‐normalized significant coefficients based on their corresponding t‐statistic P‐value, and red/blue shows whether they are negative/positive, respectively. The P‐value corresponds to a two‐tailed binomial test between npos and nneg.

  2. Differential promoter methylation (bisulfite sequencing) between healthy and tumor samples of genes expressing proline tRNAs, as measured by Δ%Me = (%MeTumor‐%MeHealthy). Refer to Table EV4 for full cancer type names and number of samples.

Figure EV6
Figure EV6. Differential methylation and copy number between healthy and tumor samples of tRNA genes, related to Fig 5
  1. Differential average methylation (bisulfite sequencing) for the isoacceptors, averaged over all genes that have a certain anticodon. The difference is measured by Δ%Me = (%MeTumor‐%MeHealthy). Refer to Table EV4 for full cancer type names and number of samples.

  2. Differential average gene copy number for the isoacceptors, averaged over all genes that have a certain anticodon. The difference is measured by ΔCNA = (CNATumor‐CNAHealthy). Only significant differences are colored, which are determined using a two‐tailed Wilcoxon rank‐sum test and corrected for multiple testing by FDR.

  3. Differential gene copy number between healthy and tumor samples of genes expressing proline tRNAs, as measured by ΔCNA. Only significant differences are colored, which are determined using a two‐tailed Wilcoxon rank‐sum test and corrected for multiple testing by FDR.

References

    1. Alexaki A, Kames J, Holcomb DD, Athey J, Santana‐Quintero LV, Lam PVN, Hamasaki‐Katagiri N, Osipova E, Simonyan V, Bar H et al (2019) Codon and codon‐pair usage tables (CoCoPUTs): facilitating genetic variation analyses and recombinant gene design. J Mol Biol 431: 2434–2441 - PubMed
    1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215: 403–410 - PubMed
    1. Athey J, Alexaki A, Osipova E, Rostovtsev A, Santana‐Quintero LV, Katneni U, Simonyan V, Kimchi‐Sarfaty C (2017) A new and updated resource for codon usage tables. BMC Bioinformatics 18: 391 - PMC - PubMed
    1. Benisty H, Weber M, Hernandez‐Alias X, Schaefer MH, Serrano L (2019a) Proliferation specific codon usage facilitates oncogene translation. bioRxiv 10.1101/695957 [PREPRINT] - DOI - PMC - PubMed
    1. Benisty H, Weber M, Hernandez‐Alias X, Schaefer MH, Serrano L (2019b) ArrayExpress E‐MTAB‐8144 https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-8144/ [DATASET]

Publication types