CorGen--measuring and generating long-range correlations for DNA sequence analysis
- PMID: 16845099
- PMCID: PMC1538783
- DOI: 10.1093/nar/gkl234
CorGen--measuring and generating long-range correlations for DNA sequence analysis
Abstract
CorGen is a web server that measures long-range correlations in the base composition of DNA and generates random sequences with the same correlation parameters. Long-range correlations are characterized by a power-law decay of the auto correlation function of the GC-content. The widespread presence of such correlations in eukaryotic genomes calls for their incorporation into accurate null models of eukaryotic DNA in computational biology. For example, the score statistics of sequence alignment and the performance of motif finding algorithms are significantly affected by the presence of genomic long-range correlations. We use an expansion-randomization dynamics to efficiently generate the correlated random sequences. The server is available at http://corgen.molgen.mpg.de.
Figures

Similar articles
-
GC-Profile: a web-based tool for visualizing and analyzing the variation of GC content in genomic sequences.Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W686-91. doi: 10.1093/nar/gkl040. Nucleic Acids Res. 2006. PMID: 16845098 Free PMC article.
-
MultiPipMaker: comparative alignment server for multiple DNA sequences.Curr Protoc Bioinformatics. 2005 Apr;Chapter 10:Unit10.4. doi: 10.1002/0471250953.bi1004s9. Curr Protoc Bioinformatics. 2005. PMID: 18428743
-
SCOPE: a web server for practical de novo motif discovery.Nucleic Acids Res. 2007 Jul;35(Web Server issue):W259-64. doi: 10.1093/nar/gkm310. Epub 2007 May 7. Nucleic Acids Res. 2007. PMID: 17485471 Free PMC article.
-
Effects of long-range correlations in DNA on sequence alignment score statistics.J Comput Biol. 2007 Jun;14(5):655-68. doi: 10.1089/cmb.2007.R008. J Comput Biol. 2007. PMID: 17683266 Review.
-
Optimization of industrial bacterial strains via mutation analysis: a high-throughput DNA sequencing and bioinformatic approach.IEEE Eng Med Biol Mag. 2004 Jul-Aug;23(4):74-6. doi: 10.1109/memb.2004.1337953. IEEE Eng Med Biol Mag. 2004. PMID: 15508388 Review. No abstract available.
Cited by
-
Genomic DNA from animals shows contrasting strand bias in large and small subsequences.BMC Genomics. 2008 Jan 25;9:43. doi: 10.1186/1471-2164-9-43. BMC Genomics. 2008. PMID: 18221531 Free PMC article.
-
Bioinformatics tools for the sequence complexity estimates.Biophys Rev. 2023 Sep 15;15(5):1367-1378. doi: 10.1007/s12551-023-01140-y. eCollection 2023 Oct. Biophys Rev. 2023. PMID: 37974990 Free PMC article. Review.
-
Strand bias structure in mouse DNA gives a glimpse of how chromatin structure affects gene expression.BMC Genomics. 2008 Jan 14;9:16. doi: 10.1186/1471-2164-9-16. BMC Genomics. 2008. PMID: 18194530 Free PMC article.
-
BiDaS: a web-based Monte Carlo BioData Simulator based on sequence/feature characteristics.Nucleic Acids Res. 2013 Jul;41(Web Server issue):W582-6. doi: 10.1093/nar/gkt420. Epub 2013 May 28. Nucleic Acids Res. 2013. PMID: 23716644 Free PMC article.
References
-
- Peng C.-K., Buldyrev S.V., Goldberger A.L., Havlin S., Sciortino F., Simons M., Stanley H.E. Long-range correlations in nucleotide sequences. Nature. 1992;356:168. - PubMed
-
- Li W., Kaneko K. Long-range correlation and partial 1/fα spectrum in a noncoding DNA sequence. Europhys. Lett. 1992;17:655.
-
- Voss R.F. Evolution of long-range fractal correlations and 1/f noise in DNA base sequences. Phys. Rev. Lett. 1992;68:3805. - PubMed
-
- Arneodo A., Bacry E., Graves P.V., Muzy J.F. Characterizing long-range correlations in DNA sequences from wavelet analysis. Phys. Rev. Lett. 1995;74:3293. - PubMed
-
- Bernaola-Galvan P., Carpena P., Roman-Roldan R., Oliver J.L. Study of statistical correlations in DNA sequences. Gene. 2002;300:105. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous