Multiperm: shuffling multiple sequence alignments while approximately preserving dinucleotide frequencies
- PMID: 19136551
- DOI: 10.1093/bioinformatics/btp006
Multiperm: shuffling multiple sequence alignments while approximately preserving dinucleotide frequencies
Abstract
Summary: Assessing the statistical significance of structured RNA predicted from multiple sequence alignments relies on the existence of a good null model. We present here a random shuffling algorithm, Multiperm, that preserves not only the gap and local conservation structure in alignments of arbitrarily many sequences, but also the approximate dinucleotide frequencies. No shuffling algorithm that simultaneously preserves these three characteristics of a multiple (beyond pairwise) alignment has been available to date. As one benchmark, we show that it produces shuffled exonic sequences having folding free energy closer to native sequences than shuffled alignments that do not preserve dinucleotide frequencies.
Availability: The Multiperm GNU Cb++ source code is available at http://www.anandam.name/multiperm
Similar articles
-
Considerations in the identification of functional RNA structural elements in genomic alignments.BMC Bioinformatics. 2007 Jan 30;8:33. doi: 10.1186/1471-2105-8-33. BMC Bioinformatics. 2007. PMID: 17263882 Free PMC article.
-
A local multiple alignment method for detection of non-coding RNA sequences.Bioinformatics. 2009 Jun 15;25(12):1498-505. doi: 10.1093/bioinformatics/btp261. Epub 2009 Apr 17. Bioinformatics. 2009. PMID: 19376823
-
CentroidAlign: fast and accurate aligner for structured RNAs by maximizing expected sum-of-pairs score.Bioinformatics. 2009 Dec 15;25(24):3236-43. doi: 10.1093/bioinformatics/btp580. Epub 2009 Oct 6. Bioinformatics. 2009. PMID: 19808876
-
Energy-based RNA consensus secondary structure prediction in multiple sequence alignments.Methods Mol Biol. 2014;1097:125-41. doi: 10.1007/978-1-62703-709-9_7. Methods Mol Biol. 2014. PMID: 24639158 Review.
-
A memory efficient method for structure-based RNA multiple alignment.IEEE/ACM Trans Comput Biol Bioinform. 2012 Jan-Feb;9(1):1-11. doi: 10.1109/TCBB.2011.86. Epub 2011 Apr 29. IEEE/ACM Trans Comput Biol Bioinform. 2012. PMID: 21576754
Cited by
-
Prediction of conserved long-range RNA-RNA interactions in full viral genomes.Bioinformatics. 2016 Oct 1;32(19):2928-35. doi: 10.1093/bioinformatics/btw323. Epub 2016 Jun 10. Bioinformatics. 2016. PMID: 27288498 Free PMC article.
-
GraphClust2: Annotation and discovery of structured RNAs with scalable and accessible integrative clustering.Gigascience. 2019 Dec 1;8(12):giz150. doi: 10.1093/gigascience/giz150. Gigascience. 2019. PMID: 31808801 Free PMC article.
-
Alignment-free comparative genomic screen for structured RNAs using coarse-grained secondary structure dot plots.BMC Genomics. 2017 Dec 2;18(1):935. doi: 10.1186/s12864-017-4309-y. BMC Genomics. 2017. PMID: 29197323 Free PMC article.
-
Automated identification of RNA 3D modules with discriminative power in RNA structural alignments.Nucleic Acids Res. 2013 Dec;41(22):9999-10009. doi: 10.1093/nar/gkt795. Epub 2013 Sep 4. Nucleic Acids Res. 2013. PMID: 24005040 Free PMC article.
-
De novo prediction of structured RNAs from genomic sequences.Trends Biotechnol. 2010 Jan;28(1):9-19. doi: 10.1016/j.tibtech.2009.09.006. Epub 2009 Nov 26. Trends Biotechnol. 2010. PMID: 19942311 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous