Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Mar;13(1):4-15.
doi: 10.2174/138920212799034749.

Measures of compositional strand bias related to replication machinery and its applications

Affiliations

Measures of compositional strand bias related to replication machinery and its applications

Kazuharu Arakawa et al. Curr Genomics. 2012 Mar.

Abstract

The compositional asymmetry of complementary bases in nucleotide sequences implies the existence of a mutational or selectional bias in the two strands of the DNA duplex, which is commonly shaped by strand-specific mechanisms in transcription or replication. Such strand bias in genomes, frequently visualized by GC skew graphs, is used for the computational prediction of transcription start sites and replication origins, as well as for comparative evolutionary genomics studies. The use of measures of compositional strand bias in order to quantify the degree of strand asymmetry is crucial, as it is the basis for determining the applicability of compositional analysis and comparing the strength of the mutational bias in different biological machineries in various species. Here, we review the measures of strand bias that have been proposed to date, including the ∆GC skew, the B(1) index, the predictability score of linear discriminant analysis for gene orientation, the signal-to-noise ratio of the oligonucleotide bias, and the GC skew index. These measures have been predominantly designed for and applied to the analysis of replication-related mutational processes in prokaryotes, but we also give research examples in eukaryotes.

Keywords: GC skew; Nucleotide composition bias; bacterial replication; replication-related mutations..

PubMed Disclaimer

Figures

Fig. (1)
Fig. (1)
Distribution of the S/N of oligomer skew in different phyla of eubacteria. The degree of replication-associated strand bias shows a characteristic distribution among the bacterial phyla, and Firmicutes is one of the most highly biased groups. The majority of the Firmicutes contain the polC subunit of the DNA polymerase, which results in a high gene strand bias. In comparison, proteobacteria belong to a group with a moderate degree of strand bias.
Fig. (2)
Fig. (2)
Clustering of the measures of strand bias and growth rate by Pearson correlations in 1083 bacterial genomes (only 254 genomes were used for 1/generation time). All strand bias measures are highly correlated (most above r > 0.70) and are also weakly correlated with properties related to the growth rate (r > 0.25).
Fig. (3)
Fig. (3)
Examples of GC skew graphs using the G-language GAE web service. G-language REST web service can be easily used by typing a URL in a browser starting with http://rest.g-language.org/ followed by a set of commands. Here, the GC skew graphs of Escherichia coli (NC_000913) are visualized using different sets of options, such as normal or wider windows (window=100000), a cumulative graph (cumulative=1), and the AT skew at a window size of 10000 bp (at=1, window=10000). Neither registration nor setup is required to use these services, allowing the user to readily make use of these tools for their research.

Similar articles

Cited by

References

    1. Nakabachi A, Yamashita A, Toh H, Ishikawa H, Dunbar H E, Moran N A, Hattori M. The 160-kilobase genome of the bacterial endosymbiont Carsonella. Science. 2006;314(5797 ):267. - PubMed
    1. Hildebrand F, Meyer A, Eyre-Walker A. Evidence of selection upon genomic GC-content in bacteria. PLoS Genet. 2010;6(9 ):e1001107. - PMC - PubMed
    1. Rocha E P, Danchin A. Base composition bias might result from competition for metabolic resources. Trends Genet. 2002;18(6 ):291–294. - PubMed
    1. Akashi H, Gojobori T. Metabolic efficiency and amino acid composition in the proteomes of Escherichia coli and Bacillus subtilis. Proc. Natl. Acad. Sci. U S A. 2002;99(6 ):3695–3700. - PMC - PubMed
    1. Mitchell D. GC content and genome length in Chargaff compliant genomes. Biochem. Biophys. Res. Commun. 2007;353(1 ):207–210. - PubMed

LinkOut - more resources