Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jan;39(Database issue):D1073-8.
doi: 10.1093/nar/gkq944. Epub 2010 Oct 19.

Protegen: a web-based protective antigen database and analysis system

Affiliations

Protegen: a web-based protective antigen database and analysis system

Brian Yang et al. Nucleic Acids Res. 2011 Jan.

Abstract

Protective antigens are specifically targeted by the acquired immune response of the host and are able to induce protection in the host against infectious and non-infectious diseases. Protective antigens play important roles in vaccine development, as biological markers for disease diagnosis, and for analysis of fundamental host immunity against diseases. Protegen is a web-based central database and analysis system that curates, stores and analyzes protective antigens. Basic antigen information and experimental evidence are curated from peer-reviewed articles. More detailed gene/protein information (e.g. DNA and protein sequences, and COG classification) are automatically extracted from existing databases using internally developed scripts. Bioinformatics programs are also applied to compute different antigen features, such as protein weight and pI, and subcellular localizations of bacterial proteins. Presently, 590 protective antigens have been curated against over 100 infectious diseases caused by pathogens and non-infectious diseases (including cancers and allergies). A user-friendly web query and visualization interface is developed for interactive protective antigen search. A customized BLAST sequence similarity search is also developed for analysis of new sequences provided by the users. To support data exchange, the information of protective antigens is stored in the Vaccine Ontology (VO) in OWL format and can also be exported to FASTA and Excel files. Protegen is publically available at http://www.violinet.org/protegen.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Semi-automatic annotation of protective antigens in Protegen overall design and architecture. Manual curation includes peer-reviewed publications from PubMed. A PubMed ID (PMID) is extracted and used to retrieve detailed citation information (e.g. authors, journal, and date). The evidence that proves the status of protective antigen for each protein is curated from published experimental studies. Vaccines associated with the protective antigens are also curated. PDB IDs are manually retrieved when available to provide 3D structure information of individual protective antigens. Internally developed script uses an input sequence ID from a NCBI database (e.g. NCBI Entrez Gene database) to automatically retrieve different types of information. The extracted DNA and protein sequences are further used for bioinformatics analyses using different methods.
Figure 2.
Figure 2.
Example of protective antigen query and BLAST sequence similarity analysis. A COG category search of ‘Cell wall/membrane/envelope biogenesis’ in conjunction with a subcellular localization search of ‘Outer Membrane’ (A) identified 11 genes from the Protegen database, including Pla from Yersinia pestis strain CO92, and Pal from Haemophilus influenza strain 86-028NP (B). Clicking the Protegen antigen ID associated with Pla provided curated data including the sequence strain, NCBI Gene GI, NCBI Protein GI, protein name, NCBI taxonomy ID, DNA and protein sequences as well as other information (C). A BLAST sequence similarity analysis of the DNA sequence produced multiple hits with significant alignments (D).

References

    1. Becker K, Hu Y, Biller-Andorno N. Infectious diseases: a global challenge. Int. J. Med. Microbiol. 2006;296:179–185. - PMC - PubMed
    1. Gregersen JP. DNA vaccines. Naturwissenschaften. 2001;88:504–513. - PubMed
    1. Bousquet J, Lockey R, Malling HJ. Allergen immunotherapy: therapeutic vaccines for allergic diseases. A WHO position paper. J. Allergy Clin. Immunol. 1998;102:558–562. - PubMed
    1. Tabi Z, Man S. Challenges for cancer vaccine development. Adv. Drug Deliv. Rev. 2006;58:902–915. - PubMed
    1. Tang S, Hewlett I. Nanoparticle-based immunoassays for sensitive and early detection of HIV-1 capsid (p24) antigen. J. Infect. Dis. 2010;201(Suppl. 1):S59–64. - PMC - PubMed

Publication types