Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Apr 29;23(9):4943.
doi: 10.3390/ijms23094943.

HPV16 and HPV18 Genome Structure, Expression, and Post-Transcriptional Regulation

Affiliations
Review

HPV16 and HPV18 Genome Structure, Expression, and Post-Transcriptional Regulation

Lulu Yu et al. Int J Mol Sci. .

Erratum in

Abstract

Human papillomaviruses (HPV) are a group of small non-enveloped DNA viruses whose infection causes benign tumors or cancers. HPV16 and HPV18, the two most common high-risk HPVs, are responsible for ~70% of all HPV-related cervical cancers and head and neck cancers. The expression of the HPV genome is highly dependent on cell differentiation and is strictly regulated at the transcriptional and post-transcriptional levels. Both HPV early and late transcripts differentially expressed in the infected cells are intron-containing bicistronic or polycistronic RNAs bearing more than one open reading frame (ORF), because of usage of alternative viral promoters and two alternative viral RNA polyadenylation signals. Papillomaviruses proficiently engage alternative RNA splicing to express individual ORFs from the bicistronic or polycistronic RNA transcripts. In this review, we discuss the genome structures and the updated transcription maps of HPV16 and HPV18, and the latest research advances in understanding RNA cis-elements, intron branch point sequences, and RNA-binding proteins in the regulation of viral RNA processing. Moreover, we briefly discuss the epigenetic modifications, including DNA methylation and possible APOBEC-mediated genome editing in HPV infections and carcinogenesis.

Keywords: RNA polyadenylation; RNA splicing; epigenetic modification; genome structure; papillomaviruses.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Papillomavirus genome structure, annotated ORFs, and conserved ACS in the origin of replication (Ori). (a) HPV16 genome and annotated ORFs. The non-structural proteins from the early region of the genome, including E1, E2, E1^E4, E5, E6, E7, and E8^E2, are shown in purple. The viral capsid proteins, L1 and L2, from the late region of the genome are shown in blue. URR (yellow), upstream regulatory region. PE and PL mark the early and late promoters, and pAE and pAL stand for the early and late polyadenylation sites. (b) Conservation of the mammalian ARS consensus sequence (ACS) [16] in the Ori of selected papillomavirus genomes. Shown in this panel are conserved bases with position weight matrix (sequence logo bits) of the 167 predicted ACS elements [16] in comparison with the viral Ori sequences from the selected viral URR tail (red)-head (black) regions. Adapted with permission from Ref. [16]. Copyright 2018 Springer Nature. ARS—autonomously replicating sequence.
Figure 2
Figure 2
Translation modes of HPV proteins from polycistronic viral transcripts. (a) Diagram of a polycistronic mRNA harboring multiple ORFs (ORF A–D), of which only the first ORF A is translated by eukaryotic ribosomes (in grey) into a functional protein. (bd) Strategies utilized by papillomaviruses to express viral proteins from polycistronic transcripts by possible ribosome leaking scanning (b), alternative splicing (c), or usage of alternative frames created by RNA splicing (d).
Figure 3
Figure 3
The transcription map of HPV16 updated from Ref. [44]. The HPV16 genome in a linear form diagramed as a bracket line consists of the upstream regulatory region (URR, dark grey box), early region (white box) and late region (light grey box) with nucleotide positions of viral promoters (P), early (pAE) and late (pAL) polyadenylation signals, and polyadenylation cleavage sites (Cs). The diagram is not to scale. The predicted viral ORFs with nucleotide positions in the viral genome and the corresponding frame used for their translation are shown above. Below the linearized genome are the mapped viral transcripts with identified exons (thick lines) and introns (thin lines). The numbers represent the nucleotide positions of splicing donor sites (5′ splice site) and splicing acceptor sites (3′ splice site). On the right are coding potentials for each transcript, with the most abundant (>60%) transcripts labeled in red color, and less abundant, but detectable ones (<5%), being underlined.
Figure 4
Figure 4
The transcription map of HPV18 updated from Ref. [56]. See the HPV16 transcription map for other description details.
Figure 5
Figure 5
Regulation of E6 and E7 translation from HPV16 and HPV18 polycistronic E6E7 transcripts by alternative RNA splicing. (a) Diagrams showing E6 translation from the unspliced E6E7 RNA and (b) E7 translation from the spliced E6*I RNA by translation re-initiation.
Figure 6
Figure 6
Regulation of HPV18 and HPV16 RNA splicing and polyadenylation by RNA cis-elements and host RBPs. (a) Diagrams (not to scale) showing the exonic (colored boxes) and intronic (thick lines) regions of HPV18 and HPV16 transcripts. The major alternative splicing events are depicted by dashed lines with the nucleotide positions of splicing sites shown below. Exonic splicing enhancers (ESE), exonic splicing silencers (ESS), intronic splicing enhancers (ISE), and negative regulatory elements (NRE) are shown in green for positive (+) and in red (−) for negative regulation. The black arrows show the host RBPs binding to individual cis-elements. The mapped branch site A in the intron branch point sequence is indicated by its genome position in the viral early transcript. (b) Depiction of HPV-infected epithelium with productive HPV infection (on the left) or with HPV-transformed cells invasive of the basement membrane (on the right). HPV-infected cells expressing viral early proteins are shown in red and the cells expressing HPV late proteins in light green. The changes in the expression of viral proteins and host splicing factors (SR proteins) are on the right of each diagram.

Similar articles

Cited by

References

    1. McBride A.A. Human papillomaviruses: Diversity, infection and host interactions. Nat. Rev. Microbiol. 2022;20:95–108. doi: 10.1038/s41579-021-00617-5. - DOI - PubMed
    1. Tirosh O., Conlan S., Deming C., Lee-Lin S.Q., Huang X., Su H.C., Freeman A.F., Segre J.A., Kong H.H. Expanded skin virome in DOCK8-deficient patients. Nat. Med. 2018;24:1815–1821. doi: 10.1038/s41591-018-0211-7. - DOI - PMC - PubMed
    1. Pastrana D.V., Peretti A., Welch N.L., Borgogna C., Olivero C., Badolato R., Notarangelo L.D., Gariglio M., FitzGerald P.C., McIntosh C.E., et al. Metagenomic Discovery of 83 New Human Papillomavirus Types in Patients with Immunodeficiency. mSphere. 2018;3:e00645-18. doi: 10.1128/mSphereDirect.00645-18. - DOI - PMC - PubMed
    1. De Martel C., Georges D., Bray F., Ferlay J., Clifford G.M. Global burden of cancer attributable to infections in 2018: A worldwide incidence analysis. Lancet Glob. Health. 2020;8:e180–e190. doi: 10.1016/S2214-109X(19)30488-7. - DOI - PubMed
    1. Sabatini M.E., Chiocca S. Human papillomavirus as a driver of head and neck cancers. Br. J. Cancer. 2020;122:306–314. doi: 10.1038/s41416-019-0602-7. - DOI - PMC - PubMed

MeSH terms

LinkOut - more resources