Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 29:9:101803.
doi: 10.1016/j.mex.2022.101803. eCollection 2022.

CoCoView - A codon conservation viewer via sequence logos

Affiliations

CoCoView - A codon conservation viewer via sequence logos

Beatriz Rodrigues Estevam et al. MethodsX. .

Abstract

Sequence logos are a simple way to display a set of aligned sequences, and they are useful to identify conserved patterns. Since their introduction, several tools have been developed for generating these representations at the single residue level (amino acids or nucleotides). We have developed a tool to build sequence logos of protein-coding sequences at the codon level, allowing more accurate analysis of coding-sequences as they represent synonymous and non-synonymous changes instead of showing only changes that imply on amino acid substitutions. We built CoCoView on top of the Logomaker Python API. It creates codon sequence logos from a multiple sequence alignment of protein-coding sequences. Some properties of the data and the generated logos can be controlled by the end-users, such as data redundancy, plot type and alphabet color. • Split aligned sequences into codon positions; • For each position compute codon frequency and information content; • Use the computed information to plot the graphic.

Keywords: Codons representation; Consensus sequence; Conserved patterns; Information theory.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Image, graphical abstract
Graphical abstract
Fig 1
Fig. 1
CoCoView logo based on a multiple sequence alignment of a region of AP2 transcription factor coding sequences from Nicotiana tabacum. (A) Sequence logo generated using WebLogo , representing a per-nucleotide analysis. (B) Sequence logo generated using CoCoView (per-codon analysis). A per-nucleotide analysis could erroneously suggest that some codons are common, which can be ruled out on a per-codon visualization. Exemplified by the codon “GAT”, at the position highlighted in gray on both sequence logos, which can be interpreted as a common codon in the per-nucleotide analysis. However, in the per-codon analysis, this codon does not occur at this position.

References

    1. Schneider T.D., Stephens R.M. Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 1990;18:6097–6100. 10.1093%2Fnar%2F18.20.6097. - PMC - PubMed
    1. Schneider T.D. In: J. Chela-Flores. Raulin F., editor. Springer; Netherlands, Dordrecht: 1996. New approaches in mathematical biology: information theory and molecular machines; pp. 313–321. Chem. Evol. Phys. Orig. Evol. Life Proc. Fourth Trieste Conf. Chem. Evol. Trieste Italy 4–8 Sept. 1995. - DOI
    1. Schneider T.D. Sequence logos, machine/channel capacity, Maxwell's demon, and molecular computers: a review of the theory of molecular machines. Nanotechnology. 1994;5:1–18. doi: 10.1088/0957-4484/5/1/001. - DOI
    1. Crooks G.E., Hon G., Chandonia J.-M., Brenner S.E. WebLogo: a sequence logo generator. Genome Res. 2004;14:1188–1190. doi: 10.1101/gr.849004. - DOI - PMC - PubMed
    1. Walsh I.M., Bowman M.A., Soto Santarriaga I.F., Rodriguez A., Clark P.L. Synonymous codon substitutions perturb cotranslational protein folding in vivo and impair cell fitness. Proc. Natl. Acad. Sci. 2020;117:3528–3534. doi: 10.1073/pnas.1907126117. - DOI - PMC - PubMed

LinkOut - more resources