Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 26;23(15):8221.
doi: 10.3390/ijms23158221.

Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction

Affiliations

Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction

Muhammad Nabeel Asim et al. Int J Mol Sci. .

Abstract

Circular ribonucleic acids (circRNAs) are novel non-coding RNAs that emanate from alternative splicing of precursor mRNA in reversed order across exons. Despite the abundant presence of circRNAs in human genes and their involvement in diverse physiological processes, the functionality of most circRNAs remains a mystery. Like other non-coding RNAs, sub-cellular localization knowledge of circRNAs has the aptitude to demystify the influence of circRNAs on protein synthesis, degradation, destination, their association with different diseases, and potential for drug development. To date, wet experimental approaches are being used to detect sub-cellular locations of circular RNAs. These approaches help to elucidate the role of circRNAs as protein scaffolds, RNA-binding protein (RBP) sponges, micro-RNA (miRNA) sponges, parental gene expression modifiers, alternative splicing regulators, and transcription regulators. To complement wet-lab experiments, considering the progress made by machine learning approaches for the determination of sub-cellular localization of other non-coding RNAs, the paper in hand develops a computational framework, Circ-LocNet, to precisely detect circRNA sub-cellular localization. Circ-LocNet performs comprehensive extrinsic evaluation of 7 residue frequency-based, residue order and frequency-based, and physio-chemical property-based sequence descriptors using the five most widely used machine learning classifiers. Further, it explores the performance impact of K-order sequence descriptor fusion where it ensembles similar as well dissimilar genres of statistical representation learning approaches to reap the combined benefits. Considering the diversity of statistical representation learning schemes, it assesses the performance of second-order, third-order, and going all the way up to seventh-order sequence descriptor fusion. A comprehensive empirical evaluation of Circ-LocNet over a newly developed benchmark dataset using different settings reveals that standalone residue frequency-based sequence descriptors and tree-based classifiers are more suitable to predict sub-cellular localization of circular RNAs. Further, K-order heterogeneous sequence descriptors fusion in combination with tree-based classifiers most accurately predict sub-cellular localization of circular RNAs. We anticipate this study will act as a rich baseline and push the development of robust computational methodologies for the accurate sub-cellular localization determination of novel circRNAs.

Keywords: circular RNA; classification; machine learning; non-coding RNA; nucleotide frequency; nucleotide physico-chemical properties; sub-cellular localization dataset; subcellular localization; web server.

PubMed Disclaimer

Conflict of interest statement

On behalf of all authors, the corresponding author declares that there are no competing personal or financial interest.

Figures

Figure 1
Figure 1
A Hierarchical Classification of Non-Coding RNAs.
Figure 2
Figure 2
Standard Specificity Figures of 7 Different Sequence Descriptors Against 5 Different Classifiers.
Figure 3
Figure 3
Standard MCC Figures of 7 Different Sequence Descriptors Against 5 Different Classifiers.
Figure 4
Figure 4
Average Specificity and MCC Figures of 7 Different Sequence Sequence Descriptors Against 5 Different Classifiers.
Figure 5
Figure 5
AU-ROC Performance Figures Produced by 5 Different Classifiers Using 3 K-gap, 3 K-mer and 2 simple Sequence Sequence Descriptors on a Benchmark Circular RNA Sub-Cellular Localization Dataset, (a) TriMonoKgap Peak Performance using 5-mers, (b) DiMonoKgap Peak Performance using 2-mers (c) RCKmer Peak Performance using 5-mers, (d) Kmer Peak Performance using 5-mers, (e) PseudoKNC Peak Performance using 5-mers, (f) EIIP Peak Performance, (g) Z-Curve Peak Performance.
Figure 6
Figure 6
CircLoc-Net: A Computational Framework for Sub-cellular Localization Prediction of circRNAs.
Figure 7
Figure 7
Process of Generating Sequence K-mers (e.g., 3-mers), Where each Particular Color Frame Denotes a Unique 3-mer.
Figure 8
Figure 8
Workflow of Generating circular RNA Sub-cellular Localization dataset comprised of following steps: Collecting raw sequences and associated sub-cellular localization’s, Eliminating Redundancy, and Transforming the dataset into Standard format. Bar Chart and Pie Graph illustrates Statistics of Dataset.

References

    1. Iyer M.K., Niknafs Y.S., Malik R., Singhal U., Sahu A., Hosono Y., Barrette T.R., Prensner J.R., Evans J.R., Zhao S., et al. The landscape of long noncoding RNAs in the human transcriptome. Nat. Genet. 2015;47:199–208. doi: 10.1038/ng.3192. - DOI - PMC - PubMed
    1. Cheng L., Leung K.S. Quantification of non-coding RNA target localization diversity and its application in cancers. J. Mol. Cell Biol. 2018;10:130–138. doi: 10.1093/jmcb/mjy006. - DOI - PubMed
    1. Frías-Lasserre D., Villagra C.A. The importance of ncRNAs as epigenetic mechanisms in phenotypic variation and organic evolution. Front. Microbiol. 2017;8:2483. doi: 10.3389/fmicb.2017.02483. - DOI - PMC - PubMed
    1. Batista P.J., Chang H.Y. Long noncoding RNAs: Cellular address codes in development and disease. Cell. 2013;152:1298–1307. doi: 10.1016/j.cell.2013.02.012. - DOI - PMC - PubMed
    1. Meng X., Li X., Zhang P., Wang J., Zhou Y., Chen M. Circular RNA: An emerging key player in RNA world. Briefings Bioinform. 2017;18:547–557. doi: 10.1093/bib/bbw045. - DOI - PubMed

Grants and funding

LinkOut - more resources