Customized strategies for discovering distant ncRNA homologs
- PMID: 19779009
- DOI: 10.1093/bfgp/elp035
Customized strategies for discovering distant ncRNA homologs
Abstract
A large fraction of non-coding RNAs is short and/or poorly conserved in sequence. Most of the longer examples, furthermore, consist of a collection of conserved structural motifs rather than a coherent globally conserved secondary structure. As a consequence, the conceptually simple problem of homology search becomes a complex and technically demanding task. Despite the best efforts of databases such as Rfam, the situation is complicated further by the sparsity of information in many--in particular prokaryotic--RNA families. In this contribution, we review recent efforts to customize sequence-based search tools for ncRNA applications. In particular, semi-global alignments and the development of methods for fragmented pattern search have brought significant practical advances. Current developments in this area focus on the integration of fragmented sequence pattern search with search algorithms for secondary structure patterns. We focus here, in particular, on strategies that can be successful in the 'twilight zone' where generic approaches from blast to infernal to start to fail.
Similar articles
-
Exploiting conserved structure for faster annotation of non-coding RNAs without loss of accuracy.Bioinformatics. 2004 Aug 4;20 Suppl 1:i334-41. doi: 10.1093/bioinformatics/bth925. Bioinformatics. 2004. PMID: 15262817
-
Considerations in the identification of functional RNA structural elements in genomic alignments.BMC Bioinformatics. 2007 Jan 30;8:33. doi: 10.1186/1471-2105-8-33. BMC Bioinformatics. 2007. PMID: 17263882 Free PMC article.
-
Modeling conserved structure patterns for functional noncoding RNA.IEEE Trans Biomed Eng. 2011 Jun;58(6):1528-33. doi: 10.1109/TBME.2010.2090043. Epub 2010 Oct 28. IEEE Trans Biomed Eng. 2011. PMID: 21041153
-
An Ariadne's thread to the identification and annotation of noncoding RNAs in eukaryotes.Brief Bioinform. 2009 Sep;10(5):475-89. doi: 10.1093/bib/bbp022. Epub 2009 Apr 21. Brief Bioinform. 2009. PMID: 19383843 Review.
-
RNA structural motifs: building blocks of a modular biomolecule.Q Rev Biophys. 2005 Aug;38(3):221-43. doi: 10.1017/S0033583506004215. Epub 2006 Jul 3. Q Rev Biophys. 2005. PMID: 16817983 Review.
Cited by
-
REDalign: accurate RNA structural alignment using residual encoder-decoder network.BMC Bioinformatics. 2024 Nov 5;25(1):346. doi: 10.1186/s12859-024-05956-7. BMC Bioinformatics. 2024. PMID: 39501155 Free PMC article.
-
PLAST-ncRNA: Partition function Local Alignment Search Tool for non-coding RNA sequences.Nucleic Acids Res. 2010 Jul;38(Web Server issue):W59-63. doi: 10.1093/nar/gkq487. Epub 2010 Jun 3. Nucleic Acids Res. 2010. PMID: 20522510 Free PMC article.
-
Homology-based annotation of non-coding RNAs in the genomes of Schistosoma mansoni and Schistosoma japonicum.BMC Genomics. 2009 Oct 8;10:464. doi: 10.1186/1471-2164-10-464. BMC Genomics. 2009. PMID: 19814823 Free PMC article.
-
Most RNAs regulating ribosomal protein biosynthesis in Escherichia coli are narrowly distributed to Gammaproteobacteria.Nucleic Acids Res. 2013 Apr 1;41(6):3491-503. doi: 10.1093/nar/gkt055. Epub 2013 Feb 8. Nucleic Acids Res. 2013. PMID: 23396277 Free PMC article.
-
Studying RNA Homology and Conservation with Infernal: From Single Sequences to RNA Families.Curr Protoc Bioinformatics. 2016 Jun 20;54:12.13.1-12.13.25. doi: 10.1002/cpbi.4. Curr Protoc Bioinformatics. 2016. PMID: 27322404 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials