Studying RNA Homology and Conservation with Infernal: From Single Sequences to RNA Families
- PMID: 27322404
- PMCID: PMC5010141
- DOI: 10.1002/cpbi.4
Studying RNA Homology and Conservation with Infernal: From Single Sequences to RNA Families
Abstract
Emerging high-throughput technologies have led to a deluge of putative non-coding RNA (ncRNA) sequences identified in a wide variety of organisms. Systematic characterization of these transcripts will be a tremendous challenge. Homology detection is critical to making maximal use of functional information gathered about ncRNAs: identifying homologous sequence allows us to transfer information gathered in one organism to another quickly and with a high degree of confidence. ncRNA presents a challenge for homology detection, as the primary sequence is often poorly conserved and de novo secondary structure prediction and search remain difficult. This unit introduces methods developed by the Rfam database for identifying "families" of homologous ncRNAs starting from single "seed" sequences, using manually curated sequence alignments to build powerful statistical models of sequence and structure conservation known as covariance models (CMs), implemented in the Infernal software package. We provide a step-by-step iterative protocol for identifying ncRNA homologs and then constructing an alignment and corresponding CM. We also work through an example for the bacterial small RNA MicA, discovering a previously unreported family of divergent MicA homologs in genus Xenorhabdus in the process. © 2016 by John Wiley & Sons, Inc.
Keywords: RNA; Rfam; alignment; conservation; covariance model; homology; ncRNA.
Copyright © 2016 John Wiley & Sons, Inc.
Figures







Similar articles
-
Computational identification of functional RNA homologs in metagenomic data.RNA Biol. 2013 Jul;10(7):1170-9. doi: 10.4161/rna.25038. Epub 2013 May 20. RNA Biol. 2013. PMID: 23722291 Free PMC article. Review.
-
Non-Coding RNA Analysis Using the Rfam Database.Curr Protoc Bioinformatics. 2018 Jun;62(1):e51. doi: 10.1002/cpbi.51. Epub 2018 Jun 5. Curr Protoc Bioinformatics. 2018. PMID: 29927072 Free PMC article.
-
Rfam: annotating families of non-coding RNA sequences.Methods Mol Biol. 2015;1269:349-63. doi: 10.1007/978-1-4939-2291-8_22. Methods Mol Biol. 2015. PMID: 25577390
-
Search for 5'-leader regulatory RNA structures based on gene annotation aided by the RiboGap database.Methods. 2017 Mar 15;117:3-13. doi: 10.1016/j.ymeth.2017.02.009. Epub 2017 Mar 6. Methods. 2017. PMID: 28279853 Free PMC article.
-
Customized strategies for discovering distant ncRNA homologs.Brief Funct Genomic Proteomic. 2009 Nov;8(6):451-60. doi: 10.1093/bfgp/elp035. Epub 2009 Sep 24. Brief Funct Genomic Proteomic. 2009. PMID: 19779009 Review.
Cited by
-
Regulatory context drives conservation of glycine riboswitch aptamers.PLoS Comput Biol. 2019 Dec 20;15(12):e1007564. doi: 10.1371/journal.pcbi.1007564. eCollection 2019 Dec. PLoS Comput Biol. 2019. PMID: 31860665 Free PMC article.
-
Identification and Characterization of Non-protein Coding RNA Homologs in Serratia Marcescens by Comparative Transcriptomics.Indian J Microbiol. 2024 Mar;64(1):198-204. doi: 10.1007/s12088-023-01160-y. Epub 2023 Dec 14. Indian J Microbiol. 2024. PMID: 38468749 Free PMC article.
-
Ms1 RNA Interacts With the RNA Polymerase Core in Streptomyces coelicolor and Was Identified in Majority of Actinobacteria Using a Linguistic Gene Synteny Search.Front Microbiol. 2022 May 11;13:848536. doi: 10.3389/fmicb.2022.848536. eCollection 2022. Front Microbiol. 2022. PMID: 35633709 Free PMC article.
-
GERONIMO: A tool for systematic retrieval of structural RNAs in a broad evolutionary context.Gigascience. 2022 Dec 28;12:giad080. doi: 10.1093/gigascience/giad080. Epub 2023 Oct 17. Gigascience. 2022. PMID: 37848616 Free PMC article.
-
Small regulatory RNAs are mediators of the Streptococcus mutans SloR regulon.J Bacteriol. 2023 Sep 26;205(9):e0017223. doi: 10.1128/jb.00172-23. Epub 2023 Sep 11. J Bacteriol. 2023. PMID: 37695854 Free PMC article.
References
-
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. Journal of molecular biology. 1990;215:403–410. - PubMed
-
- Argaman L, Hershberg R, Vogel J, Bejerano G, Wagner EGH, Margalit H, Altuvia S. Novel small RNA-encoding genes in the intergenic regions of Escherichia coli. Current biology: CB. 2001;11:941–950. - PubMed
-
- Asai K, Kiryu H, Hamada M, Tabei Y, Sato K, Matsui H, Sakakibara Y, Terai G, Mituyama T. Software.ncrna.org: web servers for analyses of RNA sequences. Nucleic acids research. 2008;36:W75–W78. - PMC - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources