Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Feb 10;3(1):8.
doi: 10.3390/ncrna3010008.

CirComPara: A Multi-Method Comparative Bioinformatics Pipeline to Detect and Study circRNAs from RNA-seq Data

Affiliations

CirComPara: A Multi-Method Comparative Bioinformatics Pipeline to Detect and Study circRNAs from RNA-seq Data

Enrico Gaffo et al. Noncoding RNA. .

Abstract

Circular RNAs (circRNAs) are generated by backsplicing of immature RNA forming covalently closed loops of intron/exon RNA molecules. Pervasiveness, evolutionary conservation, massive and regulated expression, and posttranscriptional regulatory roles of circRNAs in eukaryotes have been appreciated and described only recently. Moreover, being easily detectable disease markers, circRNAs undoubtedly represent a molecular class with high bearing on molecular pathobiology. CircRNAs can be detected from RNAseq data using appropriate computational methods to identify the sequence reads spanning backsplice junctions that do not colinearly map to the reference genome. To this end, several programs were developed and critical assessment of various strategies and tools suggested the combination of at least two methods as good practice to guarantee robust circRNA detection. Here,we present CirComPara (http://github.com/egaffo/CirComPara), an automated bioinformatics pipeline, to detect, quantify and annotate circRNAs from RNAseq data using in parallel four different methods for backsplice identification. CirComPara also provides quantification of linear RNAs and gene expression, ultimately comparing and correlating circRNA and gene/transcript expression level. We applied our method to RNAseqdata of monocyte and macrophage samples in relation to haploinsufficiency of the RNAbinding splicing factor Quaking (QKI). The biological relevance of the results, in terms of number, types and variations of circRNAs expressed, illustrates CirComPara potential to enlarge the knowledge of the transcriptome, adding details on the circRNAome, and facilitating further computational and experimental studies.

Keywords: circular RNA; CirComPara; Quaking; RNA‐seq; bioinformatics pipeline; monocytes.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
(A) CirComPara workflow. Round corner boxes represent inputs; currently used tools are represented by gray labels next to the relative pipeline level; dotted lines represent optional functions; (BD) CirComPara summary plots of circular RNAs (circRNAs) expressed; (B) absolute number of circRNAs detected by each method and (C) commonly detected by two or more methods; (D) number of circRNAs expressed per sample, considering the whole set of detected back-splices and the selected subset of circRNAs detected by at least two methods.
Figure 2
Figure 2
CirComPara summary plots of circRNA and gene expression and integration thereof. (A,B) twin boxplots of circRNA and gene expression levels per sample; (C,D) cumulative expression plots of circRNA and gene per sample; (E) frequency distribution of number of circRNAs per gene; (F) density distribution of pairwise circRNA/gene Spearman correlation values.
Figure 3
Figure 3
circRNAome expression varies in relation to Quaking (QKI) haploinsufficiency during monocyte to macrophage induced differentiation. (A) number of circRNAs expressed per sample; (B) waterfall plot of log2FC in the QKI +/− vs. QKI +/+ in the two cell types, for all the expressed circRNAs; (C) number of circRNAs with absolute log2FC > 1.5 or < −1.5 when comparing QKI haploinsufficient with control cells, separately considering monocytes and macrophages; (D) QKI circular isoforms detected from RNA-seq (the table indicates for each circRNA the genomic coordinates of the back-splice ends, the expression level per sample and the intensity of observed log2FC).

References

    1. Lasda E., Parker R. Circular RNAs: Diversity of form and function. RNA. 2014;20:1829–1842. doi: 10.1261/rna.047126.114. - DOI - PMC - PubMed
    1. Dean M., Fojo T., Bates S. Tumour stem cells and drug resistance. Nat. Rev. Cancer. 2005;5:275–284. doi: 10.1038/nrc1590. - DOI - PubMed
    1. Bachmayr-Heyda A., Reiner A.T., Auer K., Sukhbaatar N., Aust S., Bachleitner-Hofmann T., Mesteri I., Grunt T.W., Zeillinger R., Pils D. Correlation of circular RNA abundance with proliferation—Exemplified with colorectal and ovarian cancer, idiopathic lung fibrosis, and normal human tissues. Sci. Rep. 2015;5:8057. doi: 10.1038/srep08057. - DOI - PMC - PubMed
    1. Salzman J., Chen R.E., Olsen M.N., Wang P.L., Brown P.O. Cell-Type Specific Features of Circular RNA Expression. PLoS Genet. 2013;9:e1003777. doi: 10.1371/annotation/f782282b-eefa-4c8d-985c-b1484e845855. - DOI - PMC - PubMed
    1. Bonizzato A., Gaffo E., te Kronnie G., Bortoluzzi S. CircRNAs in hematopoiesis and hematological malignancies. Blood Cancer J. 2016;6:e483. doi: 10.1038/bcj.2016.81. - DOI - PMC - PubMed

LinkOut - more resources