Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Nov 12;21(1):502.
doi: 10.1186/s12859-020-03764-3.

Avian Immunome DB: an example of a user-friendly interface for extracting genetic information

Affiliations

Avian Immunome DB: an example of a user-friendly interface for extracting genetic information

Ralf C Mueller et al. BMC Bioinformatics. .

Erratum in

Abstract

Background: Genomic and genetic studies often require a target list of genes before conducting any hypothesis testing or experimental verification. With the ever-growing number of sequenced genomes and a variety of different annotation strategies, comes the potential for ambiguous gene symbols, making it cumbersome to capture the "correct" set of genes. In this article, we present and describe the Avian Immunome DB (AVIMM) for easy gene property extraction as exemplified by avian immune genes. The avian immune system is characterised by a cascade of complex biological processes underlaid by more than 1000 different genes. It is a vital trait to study particularly in birds considering that they are a significant driver in spreading zoonotic diseases. With the completion of phase II of the B10K ("Bird 10,000 Genomes") consortium's whole-genome sequencing effort, we have included 363 annotated bird genomes in addition to other publicly available bird genome data which serve as a valuable foundation for AVIMM.

Construction and content: A relational database with avian immune gene evidence from Gene Ontology, Ensembl, UniProt and the B10K consortium has been designed and set up. The foundation stone or the "seed" for the initial set of avian immune genes is based on the well-studied model organism chicken (Gallus gallus). Gene annotations, different transcript isoforms, nucleotide sequences and protein information, including amino acid sequences, are included. Ambiguous gene names (symbols) are resolved within the database and linked to their canonical gene symbol. AVIMM is supplemented by a command-line interface and a web front-end to query the database.

Utility and discussion: The internal mapping of unique gene symbol identifiers to canonical gene symbols allows for an ambiguous gene property search. The database is organised within core and feature tables, which makes it straightforward to extend for future purposes. The database design is ready to be applied to other taxa or biological processes. Currently, the database contains 1170 distinct avian immune genes with canonical gene symbols and 612 synonyms across 363 bird species. While the command-line interface readily integrates into bioinformatics pipelines, the intuitive web front-end with download functionality offers sophisticated search functionalities and tracks the origin for each record. AVIMM is publicly accessible at https://avimm.ab.mpg.de .

Keywords: Avian; B10K; Genomics; Immunogenomics; Immunology; Immunome; Trait database.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Concept of Avimm, the Avian Immunome DB
Fig. 2
Fig. 2
Logical data model (LDM) in Crow’s Foot Notation [42] of Avimm. Core tables represent the basic structure of the database and link the evidence-specific identifiers to Avimm’s unique identifiers (uid). Feature tables represent evidence-specific features like isoforms, nucleotide sequences, or amino acid sequences of the immune genes
Fig. 3
Fig. 3
Flow chart, loading of Avimm. Uncoloured boxes show preparatory steps. Yellow boxes describe imports into core tables and green boxes imports into feature tables. The initial set of immune genes in chicken is derived from “biological process/immune system process” in AmiGo 2 filtered for chicken. Gene symbols were extracted from Ensembl’s website since Ensembl has the most current/complete chicken annotation. All steps are explained in detail on the project’s wiki page [41]
Fig. 4
Fig. 4
Avimm landing page with links to the four function pages (upper box) and description for each function (lower box)
Fig. 5
Fig. 5
Excerpt of evidence function page. Gene selection either by “mark-and-arrow” or free text (top left box). Species selection based on taxonomic ranks (centre box). Checkboxes to filter columns (right box) and download data option (bottom left box). Each section has an information button (circled “i”) to provide further details
Fig. 6
Fig. 6
The gene evidence result list shows one record for each evidence of the gene IFNL3A in the selected species. The columns can be sorted (ascending or descending) by clicking on the corresponding column header. On top of the list are links to B10K, Ensembl, and UniProt result pages based on the same search criteria (upper box). If there are alternative gene symbols for the query gene in Avimm, then these are listed on the bottom of the page (lower box)
Fig. 7
Fig. 7
Excerpts of the result pages for IFNL3A in B10K (a), Ensembl (b), and UniProt (c). Each result page offers additional filters and download options. The original identifiers of the source of evidence are retained and linked to the original entries on Ensembl (b, left box) or UniProt (c, left box). Additionally, links to NCBI nucleotide sequence BLAST (a, box), NCBI gene information (b, right box), and NCBI amino acid sequence BLAST (c, right box) are provided for each record
Fig. 8
Fig. 8
Excerpt of the result page for MASP2 evidence across all 363 species in Avimm. There is evidence in all Galliformes (11/11) and almost all Anseriformes (5/7) but only in two Passeriformes (out of 173 in Avimm)

References

    1. Shendure J, Balasubramanian S, Church GM, Gilbert W, Rogers J, Schloss JA, Waterston RH. DNA sequencing at 40: past, present and future. Nature. 2017;550(7676):345–353. doi: 10.1038/nature24286. - DOI - PubMed
    1. GenBank and WGS Statistics. https://www.ncbi.nlm.nih.gov/genbank/statistics/. Accessed 13 May 2020
    1. Brusic V, Zeleznikow J, Petrovsky N. Molecular immunology databases and data repositories. J Immunol Methods. 2000;238(1):17–28. doi: 10.1016/S0022-1759(00)00159-9. - DOI - PubMed
    1. Selzer PM, Marhöfer RJ, Koch O. Biological databases. In: Selzer PM, Marhöfer RJ, Koch O, editors. Applied bioinformatics: an Introduction. Cham: Springer; 2018. pp. 13–34.
    1. Cunningham F, Achuthan P, Akanni W, Allen J, Amode MR, Armean IM, Bennett R, Bhai J, Billis K, Boddu S, Cummins C, Davidson C, Dodiya KJ, Gall A, Girón CG, Gil L, Grego T, Haggerty L, Haskell E, Hourlier T, Izuogu OG, Janacek SH, Juettemann T, Kay M, Laird MR, Lavidas I, Liu Z, Loveland JE, Marugán JC, Maurel T, McMahon AC, Moore B, Morales J, Mudge JM, Nuhn M, Ogeh D, Parker A, Parton A, Patricio M, Abdul Salam AI, Schmitt BM, Schuilenburg H, Sheppard D, Sparrow H, Stapleton E, Szuba M, Taylor K, Threadgold G, Thormann A, Vullo A, Walts B, Winterbottom A, Zadissa A, Chakiachvili M, Frankish A, Hunt SE, Kostadima M, Langridge N, Martin FJ, Muffato M, Perry E, Ruffier M, Staines DM, Trevanion SJ, Aken BL, Yates AD, Zerbino DR, Flicek P. Ensembl. Nucleic Acids Res. 2019;47(D1):745–51. doi: 10.1093/nar/gky1113. - DOI - PMC - PubMed

LinkOut - more resources