Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jul 6:2010:baq004.
doi: 10.1093/database/baq004.

OpenFluDB, a database for human and animal influenza virus

Affiliations

OpenFluDB, a database for human and animal influenza virus

Robin Liechti et al. Database (Oxford). .

Abstract

Although research on influenza lasted for more than 100 years, it is still one of the most prominent diseases causing half a million human deaths every year. With the recent observation of new highly pathogenic H5N1 and H7N7 strains, and the appearance of the influenza pandemic caused by the H1N1 swine-like lineage, a collaborative effort to share observations on the evolution of this virus in both animals and humans has been established. The OpenFlu database (OpenFluDB) is a part of this collaborative effort. It contains genomic and protein sequences, as well as epidemiological data from more than 27,000 isolates. The isolate annotations include virus type, host, geographical location and experimentally tested antiviral resistance. Putative enhanced pathogenicity as well as human adaptation propensity are computed from protein sequences. Each virus isolate can be associated with the laboratories that collected, sequenced and submitted it. Several analysis tools including multiple sequence alignment, phylogenetic analysis and sequence similarity maps enable rapid and efficient mining. The contents of OpenFluDB are supplied by direct user submission, as well as by a daily automatic procedure importing data from public repositories. Additionally, a simple mechanism facilitates the export of OpenFluDB records to GenBank. This resource has been successfully used to rapidly and widely distribute the sequences collected during the recent human swine flu outbreak and also as an exchange platform during the vaccine selection procedure. Database URL: http://openflu.vital-it.ch.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
‘Single isolate upload’ interface. The web form is split into three parts. The top one contains fields to specify isolate name, isolate type and passage history. The syntax of the isolate name is verified against sample collection year and standard influenza nomenclature (except for laboratory-derived strains). The middle part contains sample data including host and sample collection date and geographical location. The bottom part contains additional data including sample provider laboratory, sequencer laboratory, in vivo tested antiviral resistance and user note.
Figure 2.
Figure 2.
The ‘browse’ form is the major entry point to query the database. (A) ‘What’s new’ section contains links to newly deposited sequences and announcements about OpenFluDB feature development. (B) The ‘Quick search’ field is used to query on EPI_ISL_ID, EPIID, isolate name or EMBL/DDBJ/GenBank accession numbers. (C) Basic ‘browse’ fields are composed of viral type and subtype, viral host and sample collection geographical location. Multiple selections are possible. (D) Several additional filters can be appended to the query; sample collection date, submission date, minimal sequence length, isolate name, EPI_ISL_ID, passage history, lineage, EPIID, DDBJ/EMBL/GenBank accession number, sequence submitter laboratory, whether the isolate has been primarily deposited in OpenFluDB, whether the isolate is publicly accessible in DDBJ/EMBL/GenBank and whether the sequence has a complete CDS. Finally, required segments or complete genome can be specified.
Figure 3.
Figure 3.
‘Browse’ results are presented in an isolate-centric view. Columns contain either annotations or sequence length. Switching between the nucleotide and protein views is done with the top right switch. Complete CDS or proteins can be highlighted in bold. Checkboxes are used to select isolates for further analysis, export or assignment to a reference set. Analysis buttons are from left to right: MUSCLE MSA, BLAST, SSMs and isolate geolocation. Isolate annotations are exported in a Microsoft Excel file and viral nucleotide or protein sequences are exported in a FASTA file.
Figure 4.
Figure 4.
OpenFluDB analysis tools. (A) The Google Maps API is used to position sample collection on a world map. MSA are computed with MUSCLE and displayed with Jalview (B) and as a tree with PhyloWidget (C). SSMs of hemaglutinin nucleotide sequences are computed by multidimensional scaling, and projections on the first two principal dimensions are displayed. (D) Human H3N2 sequences are colored based on year of sample collection revealing a genetic drift of the sequences. (E) H1 sequences are colored by host species. Three main clusters are revealed: human, swine and avian. Red highlighted sequences from the recent human swine H1N1 lineage are located near the swine cluster.

References

    1. WHO. Geneva, Switzerland: World Health Organization; 2009. Fact Sheet 211. http://www.who.int/mediacentre/factsheets/fs211/en/
    1. Parvin JD, Moscona A, Pan WT, et al. Measurement of the mutation rates of animal viruses: influenza A virus and poliovirus type 1. J. Virol. 1986;59:377–383. - PMC - PubMed
    1. Boni MF. Vaccination and antigenic drift in influenza. Vaccine. 2008;26(Suppl. 3):C8–C14. - PMC - PubMed
    1. Cox NJ, Subbarao K. Global epidemiology of influenza: past and present. Annu. Rev. Med. 2000;51:407–421. - PubMed
    1. de Jong JC, Claas EC, Osterhaus AD, et al. A pandemic warning? Nature. 1997:389–554. - PMC - PubMed

Publication types