Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Jul;35(Web Server issue):W280-4.
doi: 10.1093/nar/gkm354. Epub 2007 Jun 1.

FLAN: a web server for influenza virus genome annotation

Affiliations

FLAN: a web server for influenza virus genome annotation

Yiming Bao et al. Nucleic Acids Res. 2007 Jul.

Abstract

FLAN (short for FLu ANnotation), the NCBI web server for genome annotation of influenza virus (http://www.ncbi.nlm.nih.gov/genomes/FLU/Database/annotation.cgi) is a tool for user-provided influenza A virus or influenza B virus sequences. It can validate and predict protein sequences encoded by an input flu sequence. The input sequence is BLASTed against a database containing influenza sequences to determine the virus type (A or B), segment (1 through 8) and subtype for the hemagglutinin and neuraminidase segments of influenza A virus. For each segment/subtype of the viruses, a set of sample protein sequences is maintained. The input sequence is then aligned against the corresponding protein set with a 'Protein to nucleotide alignment tool' (ProSplign). The translated product from the best alignment to the sample protein sequence is used as the predicted protein encoded by the input sequence. The output can be a feature table that can be used for sequence submission to GenBank (by Sequin or tbl2asn), a GenBank flat file, or the predicted protein sequences in FASTA format. A message showing the length of the input sequence, the predicted virus type, segment and subtype for the hemagglutinin and neuraminidase segments of Influenza A virus will also be displayed.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
(A) A fragment of ProSplign alignment of query influenza A virus segment 4 (at the top) against a signal peptide (first 16 amino acids of BAA21644, at the bottom). Similarity is too low for BLAST to find a significant hit. Translation in the middle becomes the annotation (see signal peptide on ABM22048). (B) The sample protein AAF99671 is aligned against the query sequence CY019262. ProSplign identified the GT/AG splicing junction. Amino acid threonine spans the splicing site. FLAN passes the coordinates of exons from the alignment to the final annotation.
Figure 2.
Figure 2.
A sample output of the FLAN tool. The top part is a feature table showing feature locations (for gene and CDS) and qualifiers (gene and product). The lower part shows the diagnostic information about the sequence annotation.
Figure 3.
Figure 3.
A NetSchedule (NS) schema. Client (end user) submits data to CGI at NCBI web server. CGI connects and sends data to the NetCache (NC) server. NC keeps data into blob and returns blob_id back to CGI. CGI connects to the NS server, submits request to execute the job with data from blob_id. NS puts this request in a queue and reports assigned job_id back to CGI. The job is waiting to be executed. WorkerNodes (WN) contacts NS constantly to check jobs in a queue. NS gives WN a job with blob_id of input data to execute. WN takes this blob_id, retrieves input data from NC, and executes the job. When execution is done, WN puts result in new blob2 in NC and gets blob2_id back from NC. WN connects to NS and reports job's execution status and blob2_id of result. NS answers to the status request from CGI with ready status and blob2_id. CGI gets blob2_id, connects to NC and retrieves blob2 with resulting data. Results data is presented to client.

References

    1. Ghedin E, Sengamalay NA, Shumway M, Zaborsky J, Feldblyum T, Subbu V, Spiro DJ, Sitz J, Koo H, et al. Large-scale sequencing of human influenza reveals the dynamic nature of viral genome evolution. Nature. 2005;437:1162–1166. - PubMed
    1. Besemer J, Borodovsky M. Heuristic approach to deriving models for gene finding. Nucleic Acids Res. 1999;27:3911–3920. - PMC - PubMed
    1. Tcherepanov V, Ehlers A, Upton C. Genome Annotation Transfer Utility (GATU): rapid annotation of viral genomes using a closely related reference genome. BMC Genomics. 2006;7:150. - PMC - PubMed
    1. Obenauer JC, Denson J, Mehta PK, Su X, Mukatira S, Finkelstein DB, Xu X, Wang J, Ma J, et al. Large-scale sequence analysis of avian influenza isolates. Science. 2006;311:1576–1580. - PubMed
    1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. - PubMed

Publication types