Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010;11(6):R61.
doi: 10.1186/gb-2010-11-6-r61. Epub 2010 Jun 15.

Design and evaluation of genome-wide libraries for RNA interference screens

Affiliations

Design and evaluation of genome-wide libraries for RNA interference screens

Thomas Horn et al. Genome Biol. 2010.

Abstract

RNA interference (RNAi) screens have enabled the systematic analysis of many biological processes in cultured cells and whole organisms. The success of such screens and the interpretation of the data depend on the stringent design of RNAi libraries. We describe and validate NEXT-RNAi, a software for the automated design and evaluation of RNAi sequences on a genome-wide scale. NEXT-RNAi is implemented as open-source software and is accessible at http://www.nextrnai.org/.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Quality control parameters for RNAi reagents at different stages of the design pipeline. (a) Long dsRNAs that have regions of low complexity, for example, CA[ATCG] repeats or simple nucleotide repeats, can exert unspecific and cytotoxic effects. The quality of the primer designs used to synthesize amplicons from DNA sources is crucial, in particular when the synthesis is performed in 96- or 384-well formats where primers should have similar melting temperatures. (b) Dicer-mediated cleavage of long dsRNAs leads to the generation of siRNAs of lengths between 19 and 23 nucleotides [29]. The quality of siRNAs depends on their ability to efficiently enter the RNA-induced silencing complex (RISC) and to access the target mRNA. This is influenced by thermodynamic properties, base preferences and chemical modifications. The specificity of siRNAs is influenced by sequence-independent and sequence-dependent features. siRNAs can trigger interferon responses or show concentration-dependent cytotoxic effects, independent of their sequence. Silencing of unintended target transcripts can occur through perfect and imperfect sequence homologies to the siRNA and through 'seed matches' to the transcript 3' UTRs. See text for details.
Figure 2
Figure 2
Overview of the NEXT-RNAi workflow. NEXT-RNAi requires a defined set of input files in FASTA or tab-delimited formats. First, the program filters the input target sequences for six (default) or more contiguous CAN repeats and for other regions of low complexity (for example, simple nucleotide repeats) using mdust. Sequences are then 'diced' to generate all possible siRNA sequences with a default length of 19 nucleotides (nt) and an offset of 1 nucleotide. Subsequently, each siRNA is mapped to a user-defined off-target database (for example, the whole transcriptome) with Bowtie [37] to determine its specificity. The specificity is set to one if the siRNA targets a single gene or to zero otherwise. In the next step, the predicted efficiency of each 19-nucleotide siRNA is computed. Two methods can be selected, the 'rational' method according to Reynolds et al. [9] and the 'weighted' method according to Shah et al. [12], assigning each siRNA an efficiency score between 0 and 100. Optionally, the seed complement frequency for each siRNA can be computed for any FASTA file provided (for example, a file containing 3' UTR sequences). siRNAs that did not pass the low-complexity filters, show perfect homology to multiple target genes or do not meet the user-defined cutoffs for efficiency or seed complement frequency are excluded from the queried target sequences. Remaining sequences are used as templates for primer design (with Primer3 [36]) for long dsRNAs or are directly subjected to the final ranking for the design of siRNAs. Designs are ranked by (i) their predicted specificity and (ii) their predicted efficiency and, in the case of siRNA designs, (iii) their calculated seed complement frequency. Sequences can also be evaluated for additional features, such as homology to unintended transcripts, or SNP and UTR contents. Final designs can be visualized using GBrowse [40]. All results are presented in a comprehensive HTML report and are also exported to text files.
Figure 3
Figure 3
Example of design and filter methods applied by NEXT-RNAi. (a) Visualization of the Paf-AHalpha gene model and transcripts. Regions labeled as 'common region' serve as input for NEXT-RNAi (ORF = open reading frame, UTR = untranslated region). (b) Quality measures computed by NEXT-RNAi for the common regions. Blue and red regions label predicted low-complexity regions (including CAN repeats) and 19-nucleotide off-target regions, respectively. The lower panel shows the predicted siRNA efficiency according to Shah et al. [12] (averaged for ten siRNAs). (c) NEXT-RNAi predictions of optimal target sites (green) after discarding 19-nucleotide off-target and low complexity regions and regions <150 nucleotides or >250 nucleotides. If available, these regions are directly used as templates for primer designs (left and middle panels). Otherwise, a redesign method is used that connects closest 'optimal' neighbors until a region suitable for primer designs is identified (right panel). Potential dsRNAs are finally ranked by their specificity and efficiency.
Figure 4
Figure 4
Pairwise comparison of Drosophila RNAi libraries. Eight Drosophila RNAi libraries are compared to identify the number of genes that are targeted by multiple libraries (outer ring, light grey) and the number of genes that are targeted by independent designs (inner ring, dark grey). The reference for the ring sizes is the combination of libraries commonly targeting the most genes (HD2/DRSCv2: 13,197 genes).
Figure 5
Figure 5
Knock-down validation of Drosophila phosphatases. (a) Experimental workflow for knock-down validation. (b) Frequencies of observed knock-down efficiencies for independent designs. (c) The number of genes efficiently silenced (knock-down >66%) by both independent designs, only one or neither design. qPCR, quantitative RT-PCR.

Similar articles

Cited by

References

    1. Fire A, Xu S, Montgomery MK, Kostas SA, Driver SE, Mello CC. Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. Nature. 1998;391:806–811. doi: 10.1038/35888. - DOI - PubMed
    1. Chapman EJ, Carrington JC. Specialization and evolution of endogenous small RNA pathways. Nat Rev Genet. 2007;8:884–896. doi: 10.1038/nrg2179. - DOI - PubMed
    1. Boutros M, Ahringer J. The art and design of genetic screens: RNA interference. Nat Rev Genet. 2008;9:554–566. doi: 10.1038/nrg2364. - DOI - PubMed
    1. Dietzl G, Chen D, Schnorrer F, Su KC, Barinova Y, Fellner M, Gasser B, Kinsey K, Oppel S, Scheiblauer S, Couto A, Marra V, Keleman K, Dickson BJ. A genome-wide transgenic RNAi library for conditional gene inactivation in Drosophila. Nature. 2007;448:151–156. doi: 10.1038/nature05954. - DOI - PubMed
    1. Fuchs F, Boutros M. Cellular phenotyping by RNAi. Brief Funct Genomic Proteomic. 2006;5:52–56. doi: 10.1093/bfgp/ell007. - DOI - PubMed

Publication types

Substances