Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Jan;41(Database issue):D571-8.
doi: 10.1093/nar/gks984. Epub 2012 Oct 23.

The Papillomavirus Episteme: a central resource for papillomavirus sequence data and analysis

Affiliations

The Papillomavirus Episteme: a central resource for papillomavirus sequence data and analysis

Koenraad Van Doorslaer et al. Nucleic Acids Res. 2013 Jan.

Abstract

The goal of the Papillomavirus Episteme (PaVE) is to provide an integrated resource for the analysis of papillomavirus (PV) genome sequences and related information. The PaVE is a freely accessible, web-based tool (http://pave.niaid.nih.gov) created around a relational database, which enables storage, analysis and exchange of sequence information. From a design perspective, the PaVE adopts an Open Source software approach and stresses the integration and reuse of existing tools. Reference PV genome sequences have been extracted from publicly available databases and reannotated using a custom-created tool. To date, the PaVE contains 241 annotated PV genomes, 2245 genes and regions, 2004 protein sequences and 47 protein structures, which users can explore, analyze or download. The PaVE provides scientists with the data and tools needed to accelerate scientific progress for the study and treatment of diseases caused by PVs.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Papillomavirus Genomic Organization. The double-stranded, circular genomes of PVs are ∼7–8 kb in length. All PVs encode the E1 and E2 replication proteins and the L1 and L2 capsid proteins. In addition to these core proteins, most viruses encode auxiliary proteins E5, E6 and E7, which manipulate host cell proliferation and cell cycle checkpoints. The URR is located between the L1 and E6 ORFs. This region contains viral promoters, enhancers and the replication origin.
Figure 2.
Figure 2.
Genome annotation pipeline employed in the PaVE: the flowchart outlines the different steps used to curate the annotation of viral ORFs. See the main text for a detailed description.
Figure 3.
Figure 3.
The PaVE locus viewer. The PaVE locus view shows a linear representation of the HPV16REF genome. Different annotations are displayed and can be selected. The annotated features include ORFs, spliced proteins, protein-binding site motifs and the viral URR. Upon selection, additional information is displayed in the ‘selected feature details’ window. The ‘coding region sequences’ window shows the nucleotide sequence for the selected ORF. Protein sequences can be directly compared with others in the PaVE database using the provided link to BLAST.
Figure 4.
Figure 4.
The PaVE structure viewer. The HPV16 L1 structure (PDB No. 1DZL) is shown within the PaVE structure viewer. The PaVE structure viewer consists of five different modules. Module (A) shows basic information about the structure under investigation and includes links to the locus viewer and the original PDB file. Module (B) is constructed around a fully functional Jmol Console (www.Jmol.org), allowing manipulation of the viral structure. For some viral proteins, the structures have been solved for several homologous proteins isolated from different viral types. These alternative structures can be selected using a pull down menu in (C). A unique feature of the PaVE structure viewer is the pairwise alignment shown in (D). Under default conditions, this alignment shows the sequence of the PaVE RefClone protein to the sequence present in the PDB file. However, users can use module (E) to compare homologous sequences with the displayed structure. Statistics derived from the BLASTp alignment are provided and differences or identities can be highlighted on the alignment. Specific residues or groups of residues can be selected and highlighted on the structure.

References

    1. de Villiers EM, Fauquet C, Broker TR, Bernard HU, zur Hausen H. Classification of papillomaviruses. Virology. 2004;324:17–27. - PubMed
    1. Bernard HU, Burk RD, Chen Z, Van Doorslaer K, zur Hausen H, de Villiers EM. Classification of papillomaviruses (PVs) based on 189 PV types and proposal of taxonomic amendments. Virology. 2010;401:70–79. - PMC - PubMed
    1. Trottier H, Franco EL. The epidemiology of genital human papillomavirus infection. vaccine. 2006;24(Suppl. 1):S1–15. - PubMed
    1. Bouvard V, Baan R, Straif K, Grosse Y, Secretan B, El GF, Benbrahim-Tallaa L, Guha N, Freeman C, Galichet L, et al. A review of human carcinogens—Part B: biological agents. Lancet Oncol. 2009;10:321–322. - PubMed
    1. Garcia-Vallve S, Alonso A, Bravo IG. Papillomaviruses: different genes have different histories. Trends Microbiol. 2005;13:514–521. - PubMed

Publication types