Statistical inference for well-ordered structures in nucleotide sequences
- PMID: 16452793
Statistical inference for well-ordered structures in nucleotide sequences
Abstract
Distinct, local structures are frequently correlated with functional RNA elements involved in post-transcriptional regulation of gene expression. Discovery of microRNAs (miRNAs) suggests that there are a large class of small non-coding RNAs in eukaryotic genomes. These miRNAs have the potential to form distinct fold-back stem-loop structures. The prediction of those well-ordered folding sequences (WFS) in genomic sequences is very helpful for our understanding of RNA-based gene regulation and the determination of local RNA elements with structure-dependent functions. In this study, we describe a novel method for discovering the local WFS in a nucleotide sequence by Monte Carlo simulation and RNA folding. In the approach the quality of a local WFS is assessed by the energy difference (E(diff)) between the optimal structure folded in the local segment and its corresponding optimal, restrained structure where all the previous base pairings formed in the optimal structure are prohibited. Distinct WFS can be discovered by scanning successive segments along a sequence for evaluating the difference between E(diff) of the natural sequence and those computed from randomly shuffled sequences. Our results indicate that the statistically significant WFS detected in the genomic sequences of Caenorhabditis elegans (C.elegans) F49E12, T07C5, T07D1, T10H9, Y56A3A and Y71G12B are coincident with known fold-back stem-loops found in miRNA precursors. The potential and implications of our method in searching for miRNAs in genomes is discussed.
Similar articles
-
An algorithm for searching RNA motifs in genomic sequences.Biomol Eng. 2007 Sep;24(3):343-50. doi: 10.1016/j.bioeng.2007.02.005. Epub 2007 Mar 3. Biomol Eng. 2007. PMID: 17482512
-
Discovering well-ordered folding patterns in nucleotide sequences.Bioinformatics. 2003 Feb 12;19(3):354-61. doi: 10.1093/bioinformatics/btf826. Bioinformatics. 2003. PMID: 12584120
-
Considerations in the identification of functional RNA structural elements in genomic alignments.BMC Bioinformatics. 2007 Jan 30;8:33. doi: 10.1186/1471-2105-8-33. BMC Bioinformatics. 2007. PMID: 17263882 Free PMC article.
-
Energy-based RNA consensus secondary structure prediction in multiple sequence alignments.Methods Mol Biol. 2014;1097:125-41. doi: 10.1007/978-1-62703-709-9_7. Methods Mol Biol. 2014. PMID: 24639158 Review.
-
An overview of RNA structure prediction and applications to RNA gene prediction and RNAi design.Curr Protoc Bioinformatics. 2006 Mar;Chapter 12:Unit 12.1. doi: 10.1002/0471250953.bi1201s13. Curr Protoc Bioinformatics. 2006. PMID: 18428758 Review.
Cited by
-
Data mining of functional RNA structures in genomic sequences.Wiley Interdiscip Rev Data Min Knowl Discov. 2011 Jan-Feb;1(1):88-95. doi: 10.1002/widm.13. Epub 2011 Jan 10. Wiley Interdiscip Rev Data Min Knowl Discov. 2011. PMID: 34306322 Free PMC article.