Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign
- PMID: 17445273
- PMCID: PMC1868766
- DOI: 10.1186/1471-2105-8-130
Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign
Abstract
Background: Joint alignment and secondary structure prediction of two RNA sequences can significantly improve the accuracy of the structural predictions. Methods addressing this problem, however, are forced to employ constraints that reduce computation by restricting the alignments and/or structures (i.e. folds) that are permissible. In this paper, a new methodology is presented for the purpose of establishing alignment constraints based on nucleotide alignment and insertion posterior probabilities. Using a hidden Markov model, posterior probabilities of alignment and insertion are computed for all possible pairings of nucleotide positions from the two sequences. These alignment and insertion posterior probabilities are additively combined to obtain probabilities of co-incidence for nucleotide position pairs. A suitable alignment constraint is obtained by thresholding the co-incidence probabilities. The constraint is integrated with Dynalign, a free energy minimization algorithm for joint alignment and secondary structure prediction. The resulting method is benchmarked against the previous version of Dynalign and against other programs for pairwise RNA structure prediction.
Results: The proposed technique eliminates manual parameter selection in Dynalign and provides significant computational time savings in comparison to prior constraints in Dynalign while simultaneously providing a small improvement in the structural prediction accuracy. Savings are also realized in memory. In experiments over a 5S RNA dataset with average sequence length of approximately 120 nucleotides, the method reduces computation by a factor of 2. The method performs favorably in comparison to other programs for pairwise RNA structure prediction: yielding better accuracy, on average, and requiring significantly lesser computational resources.
Conclusion: Probabilistic analysis can be utilized in order to automate the determination of alignment constraints for pairwise RNA structure prediction methods in a principled fashion. These constraints can reduce the computational and memory requirements of these methods while maintaining or improving their accuracy of structural prediction. This extends the practical reach of these methods to longer length sequences. The revised Dynalign code is freely available for download.
Figures










Similar articles
-
Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints.BMC Bioinformatics. 2006 Sep 4;7:400. doi: 10.1186/1471-2105-7-400. BMC Bioinformatics. 2006. PMID: 16952317 Free PMC article.
-
PARTS: probabilistic alignment for RNA joinT secondary structure prediction.Nucleic Acids Res. 2008 Apr;36(7):2406-17. doi: 10.1093/nar/gkn043. Epub 2008 Feb 26. Nucleic Acids Res. 2008. PMID: 18304945 Free PMC article.
-
Predicting a set of minimal free energy RNA secondary structures common to two sequences.Bioinformatics. 2005 May 15;21(10):2246-53. doi: 10.1093/bioinformatics/bti349. Epub 2005 Feb 24. Bioinformatics. 2005. PMID: 15731207
-
Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change.BMC Bioinformatics. 2006 Mar 27;7:173. doi: 10.1186/1471-2105-7-173. BMC Bioinformatics. 2006. PMID: 16566836 Free PMC article.
-
Energy-based RNA consensus secondary structure prediction in multiple sequence alignments.Methods Mol Biol. 2014;1097:125-41. doi: 10.1007/978-1-62703-709-9_7. Methods Mol Biol. 2014. PMID: 24639158 Review.
Cited by
-
Alignments of biomolecular contact maps.Interface Focus. 2021 Jun 11;11(4):20200066. doi: 10.1098/rsfs.2020.0066. eCollection 2021 Jun. Interface Focus. 2021. PMID: 34123355 Free PMC article.
-
A review of three different studies on hidden markov models for epigenetic problems: a computational perspective.Genomics Inform. 2014 Dec;12(4):145-50. doi: 10.5808/GI.2014.12.4.145. Epub 2014 Dec 31. Genomics Inform. 2014. PMID: 25705151 Free PMC article. Review.
-
RNAstructure: software for RNA secondary structure prediction and analysis.BMC Bioinformatics. 2010 Mar 15;11:129. doi: 10.1186/1471-2105-11-129. BMC Bioinformatics. 2010. PMID: 20230624 Free PMC article.
-
Identification of new high affinity targets for Roquin based on structural conservation.Nucleic Acids Res. 2018 Dec 14;46(22):12109-12125. doi: 10.1093/nar/gky908. Nucleic Acids Res. 2018. PMID: 30295819 Free PMC article.
-
Stochastic sampling of the RNA structural alignment space.Nucleic Acids Res. 2009 Jul;37(12):4063-75. doi: 10.1093/nar/gkp276. Epub 2009 May 8. Nucleic Acids Res. 2009. PMID: 19429694 Free PMC article.
References
-
- Durbin R, Eddy SR, Krogh A, Mitchison G. Biological Sequence Analysis : Probabilistic Models of Proteins and Nucleic Acids. Cambridge, UK: Cambridge University Press; 1999.
-
- Pace NR, Thomas BC, Woese CR. The RNA World. second. Cold Spring Harbor Laboratory Press; 1999. Probing RNA structure, function and history by comparative analysis; pp. 113–141.
-
- Sankoff D. Simultaneous Solution of RNA Folding, Alignment and Protosequence Problems. SIAM J App Math. 1985;8(5):810–825. doi: 10.1137/0145048. - DOI
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources