Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Apr;35(4):350-353.
doi: 10.1038/nbt.3807. Epub 2017 Mar 6.

Prediction of potent shRNAs with a sequential classification algorithm

Affiliations

Prediction of potent shRNAs with a sequential classification algorithm

Raphael Pelossof et al. Nat Biotechnol. 2017 Apr.

Abstract

We present SplashRNA, a sequential classifier to predict potent microRNA-based short hairpin RNAs (shRNAs). Trained on published and novel data sets, SplashRNA outperforms previous algorithms and reliably predicts the most efficient shRNAs for a given gene. Combined with an optimized miR-E backbone, >90% of high-scoring SplashRNA predictions trigger >85% protein knockdown when expressed from a single genomic integration. SplashRNA can significantly improve the accuracy of loss-of-function genetics studies and facilitates the generation of compact shRNA libraries.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Computational modeling of advancements in shRNA technology. (a) Sequential advances in shRNA dataset development. The schematic shows diverse biological shRNA potency datasets and their feature and class label distribution biases. Unbiased large-scale sets include a comprehensive representation of negatives but contain few positives (left panel). Sets selected using prediction tools show higher rates of positives, leading to a more complete representation of this class, at the cost of changing the feature distribution of the negatives (middle panel). Use of the optimized miR-E backbone that boosts primary microRNA processing changes the requirements for potent RNAi, altering the target prediction rule (right panel). (b) Concept and equation of SplashRNA. We model the advancement in shRNA technology as a sequential support vector machine (SVM) classifier. The first classifier is trained on miR-30 data to remove non-functional sequences and the second classifier is trained on miR-E data to increase prediction performance of the remaining shRNAs. The final output is a weighted combination of the scores from both classifiers.
Figure 2
Figure 2
Benchmarking SplashRNA prediction performance. (a) Precision-recall curves of SplashRNA performance on the external shERWOOD miR-30 dataset. The first classifier alone, SplashmiR-30 (area under the precision-recall curve, auPR: 0.615), shows the best performance. SplashRNA (area under the precision-recall curve, auPR: 0.506) compromises slightly on miR-30 data to increase prediction accuracy on miR-E shRNAs (b), while still outperforming three other si/shRNA prediction tools (DSIR, seqScore, miR_Scan). (b) SplashRNA performance on miR-E data. SplashRNA (auPR: 0.611) clearly outperforms the miR-30 classifier alone (auPR: 0.572) as well as three other prediction tools. (c) Identification of “gold-standard” essential genes. The hit detection accuracy of top SplashRNA predictions was compared to larger sets of shRNAs and to CRISPR tools. Prediction performance as measured by the area under the receiver operating characteristic (auROC) curve indicates that the accuracy of the top 10 SplashRNA predictions is no different than the performance obtained by testing 25 shRNAs (the entire library). Additionally, the 10 top scoring shRNAs are significantly better predictors of the gold-standard genes set than the 10 bottom scoring shRNAs by SplashRNA (p < 0.001, empirical permutation test) and the bottom 5 SplashRNA predictions have lesser predictive value than the bottom 10 (auROC: 0.747 vs. 0.819, respectively). The dashed line represents the 10% false positive rate (FPR) threshold. (d-e) Western blot validation of de novo SplashRNA predictions. All shRNAs were expressed using LEPG at single-copy conditions. β-Actin (Actb) was used for normalization. Short (top) and long (bottom) exposures are shown. Immunoblotting of (d) Pten (median knockdown 96%, median score 1.60) and (e) Bap1 (median knockdown 93%, median score 1.05) in NIH/3T3s (Sup Figure 6i). C, miR-30 and miR-E control shRNAs. (f) Score distribution of fifth highest SplashRNA predictions for all human and mouse genes, indicating the proportion of genes with 5 predictions above a given score. Predictions were run only on the intersection of all transcript variants per gene and after shortening of transcripts due to ApA. The inset shows the score distribution of all human and mouse SplashRNA predictions. The kink in the curves represents the transition from SplashmiR-30 to combined SplashRNA scores. At least 80% of genes have five shRNAs with prediction scores above 1.

References

    1. Fellmann C, Lowe SW. Stable RNA interference rules for silencing. Nat Cell Biol. 2014;16:10–8. - PMC - PubMed
    1. Guda S, et al. miRNA-embedded shRNAs for Lineage-specific BCL11A Knockdown and Hemoglobin F Induction. Mol Ther. 2015;23:1465–74. - PMC - PubMed
    1. Grimm D, et al. Fatality in mice due to oversaturation of cellular microRNA/short hairpin RNA pathways. Nature. 2006;441:537–541. - PubMed
    1. McBride JL, et al. Artificial miRNAs mitigate shRNA-mediated toxicity in the brain: implications for the therapeutic development of RNAi. Proc Natl Acad Sci U S A. 2008;105:5868–5873. - PMC - PubMed
    1. Baek ST, et al. Off-target effect of doublecortin family shRNA on neuronal migration associated with endogenous microRNA dysregulation. Neuron. 2014;82:1255–1262. - PMC - PubMed

Supplementary references

    1. Yi R, Doehle BP, Qin Y, Macara IG, Cullen BR. Overexpression of exportin 5 enhances RNA interference mediated by short hairpin RNAs and microRNAs. RNA. 2005;11:220–226. - PMC - PubMed
    1. Boudreau RL, Martins II, Davidson BL. Artificial MicroRNAs as siRNA Shuttles: Improved Safety as Compared to shRNAs In vitro and In vivo. Mol Ther. 2008;17:169–175. - PMC - PubMed
    1. Sigoillot FD, et al. A bioinformatics method identifies prominent off-targeted transcripts in RNAi screens. Nat Methods. 2012;9:363–366. - PMC - PubMed
    1. Khvorova A, Reynolds A, Jayasena SD. Functional siRNAs and miRNAs exhibit strand bias. Cell. 2003;115:209–216. - PubMed
    1. Reynolds A, et al. Rational siRNA design for RNA interference. Nat Biotechnol. 2004;22:326–30. - PubMed

Publication types

MeSH terms

Substances