Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 Nov 11;100(23):13585-90.
doi: 10.1073/pnas.1735466100. Epub 2003 Oct 30.

Reevaluation of human cytomegalovirus coding potential

Affiliations

Reevaluation of human cytomegalovirus coding potential

Eain Murphy et al. Proc Natl Acad Sci U S A. .

Abstract

The Bio-Dictionary-based Gene Finder was used to reassess the coding potential of the AD169 laboratory strain of human cytomegalovirus and sequences in the Toledo strain that are missing in the laboratory strain of the virus. The gene-finder algorithm assesses the potential of an ORF to encode a protein based on matches to a database of amino acid patterns derived from a large collection of proteins. The algorithm was used to score all human cytomegalovirus ORFs with the potential to encode polypeptides >/=50 aa in length. As a further test for functionality, the genomes of the chimpanzee, rhesus, and murine cytomegaloviruses were searched for orthologues of the predicted human cytomegalovirus ORFs. The analysis indicates that 37 previously annotated ORFs ought to be discarded, and at least nine previously unrecognized ORFs with relatively strong coding potential should be added. Thus, the human cytomegalovirus genome appears to contain approximately 192 unique ORFs with the potential to encode a protein. Support for several of the predictions of our in silico analysis was obtained by sequencing several domains within a clinical isolate of human cytomegalovirus.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
BDGF analysis of HCMV ORFs. All previously annotated ORFs of HCMV are listed from high (upper left) to low (lower right) normalized BDGF (N-BDGF) scores. Expression of the ORF was scored as positive if a report exists in the published literature directly demonstrating the expression of a protein or showing that a mutation within the ORF generates a viral growth phenotype. NR indicates that no report was found. The presence of orthologues corresponding to HCMV ORFs within the genomes of chimpanzee cytomegalovirus (CCMV), rhesus cytomegalovirus (RhCMV), and murine cytomegalovirus (MCMV) are indicated. Green boxes designate characteristics favoring the conclusion that an ORF encodes a protein, and red boxes mark characteristics arguing that an ORF does not encode a protein. The length of the polypeptide potentially encoded by each ORF and the presence of a Kozak translational initiation motif (Kozak AUG) are provided for informational purposes. The UL65 ORF was reported to be expressed (26), but the reported polypeptide does not match the UL65 sequence (3).
Fig. 2.
Fig. 2.
Map of the HCMV genome. The green arrows represent previously annotated ORFs of HCMV likely to encode a bona fide polypeptide, red arrows designate ORFs unlikely to encode a polypeptide, and blue arrows indicate previously unrecognized ORFs that the present analysis predicts have high potential to encode proteins. The gray box marks the additional sequence found in the HCMV Toledo strain, locating it with respect to the AD169 genome. Rectangles superimposed on the line represent the sequence-identify terminal repeats. Each mark on the sequence line represents 1,000 bp.

References

    1. Britt, W. J. & Alford, C. A. (1996) in Fields Virology, eds. Fields, B. N., Knipe, D. M. & Howley, P. M. (Lippincott-Raven, Philadelphia), Vol. 2, pp. 2493-2524.
    1. Bankier, A. T., Beck, S., Bohni, R., Brown, C. M., Cerny, R., Chee, M. S., Hutchison, C. A., III, Kouzarides, T., Martignetti, J. A., Preddie, E., et al. (1991) DNA Sequences 2, 1-12. - PubMed
    1. Chee, M. S., Bankier, S., Beck, S., Bohni, R., Brown, C. R., Horsnell, T., Hutchisno, C. A., III, Kouzarides, T., Martignetti, J. A., Preddie, E., et al. (1990) Curr. Top. Microbiol. Immunol. 154, 125-169. - PubMed
    1. Baer, R., Bankier, A. T., Biggin, M. D., Deininger, P. L., Farrell, P. J., Gibson, T. J., Hatfull, G., Hudson, G. S., Satchwell, S. C., Seguin, C., et al. (1984) Nature 310, 207-211. - PubMed
    1. McGeoch, D. J., Dalrymple, M. A., Davison, A. J., Dolan, A., Frame, M. C., McNab, D., Perry, L. J., Scott, J. E. & Taylor, P. (1988) J. Gen. Virol. 69, 1531-1574. - PubMed

Publication types