From GC skews to wavelets: a gentle guide to the analysis of compositional asymmetries in genomic data
- PMID: 17988781
- DOI: 10.1016/j.biochi.2007.09.015
From GC skews to wavelets: a gentle guide to the analysis of compositional asymmetries in genomic data
Abstract
Compositional asymmetries are pervasive in DNA sequences. They are the result of the asymmetric interactions between DNA and cellular mechanisms such as replication and transcription. Here, we review many of the methods that have been proposed over the years to analyse compositional asymmetries in DNA sequences. Among these we list GC skews, oligonucleotide skews and wavelets, which among other uses have been extensively employed to delimitate origins and termini of replication in genomes. We also review the use of multivariate methods, such as factorial correspondence analysis, discriminant analysis and analysis of variance, which allow assigning compositional strand asymmetries to the different biological processes shaping sequence composition. Finally, we review methods that have been used to infer substitution matrices and allow understanding the mutational processes underlying strand asymmetry. We focus on replication asymmetries because they have been more thoroughly studied, but the methods may be adapted, and often are, to other problems. Although strand asymmetry has been studied more frequently through compositional skews of nucleotides or oligonucleotides, we recall that, depending on the goal of the analysis, other methods may be more appropriate to answer certain biological questions. We also refer to programs freely available to analyse strand asymmetry.
Similar articles
-
Replication-associated mutational asymmetry in the human genome.Mol Biol Evol. 2011 Aug;28(8):2327-37. doi: 10.1093/molbev/msr056. Epub 2011 Mar 2. Mol Biol Evol. 2011. PMID: 21368316
-
A new method for assessing the effect of replication on DNA base composition asymmetry.Mol Biol Evol. 2007 Oct;24(10):2169-79. doi: 10.1093/molbev/msm148. Epub 2007 Jul 23. Mol Biol Evol. 2007. PMID: 17646257
-
Replication-associated strand asymmetries in vertebrate genomes and implications for replicon size, DNA replication origin, and termination.Biochem Biophys Res Commun. 2006 Jun 16;344(4):1258-62. doi: 10.1016/j.bbrc.2006.04.039. Epub 2006 Apr 24. Biochem Biophys Res Commun. 2006. PMID: 16650814
-
Strand asymmetries across genomic processes.Comput Struct Biotechnol J. 2023 Mar 11;21:2036-2047. doi: 10.1016/j.csbj.2023.03.007. eCollection 2023. Comput Struct Biotechnol J. 2023. PMID: 36968020 Free PMC article. Review.
-
Strand asymmetries in DNA evolution.Trends Genet. 1997 Jun;13(6):240-5. doi: 10.1016/S0168-9525(97)01118-9. Trends Genet. 1997. PMID: 9196330 Review.
Cited by
-
A rolling circle replication mechanism produces multimeric lariats of mitochondrial DNA in Caenorhabditis elegans.PLoS Genet. 2015 Feb 18;11(2):e1004985. doi: 10.1371/journal.pgen.1004985. eCollection 2015 Feb. PLoS Genet. 2015. PMID: 25693201 Free PMC article.
-
Long-range bidirectional strand asymmetries originate at CpG islands in the human genome.Genome Biol Evol. 2009 Aug 3;1:189-97. doi: 10.1093/gbe/evp024. Genome Biol Evol. 2009. PMID: 20333189 Free PMC article.
-
Measures of compositional strand bias related to replication machinery and its applications.Curr Genomics. 2012 Mar;13(1):4-15. doi: 10.2174/138920212799034749. Curr Genomics. 2012. PMID: 22942671 Free PMC article.
-
Genome Projector: zoomable genome map with multiple views.BMC Bioinformatics. 2009 Jan 23;10:31. doi: 10.1186/1471-2105-10-31. BMC Bioinformatics. 2009. PMID: 19166610 Free PMC article.
-
Asymmetry indices for analysis and prediction of replication origins in eukaryotic genomes.PLoS One. 2012;7(9):e45050. doi: 10.1371/journal.pone.0045050. Epub 2012 Sep 27. PLoS One. 2012. PMID: 23028755 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous