Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 7;50(D1):D898-D911.
doi: 10.1093/nar/gkab929.

VEuPathDB: the eukaryotic pathogen, vector and host bioinformatics resource center

Beatrice Amos  1 Cristina Aurrecoechea  2 Matthieu Barba  3 Ana Barreto  4   5 Evelina Y Basenko  1 Wojciech Bażant  6 Robert Belnap  2 Ann S Blevins  7 Ulrike Böhme  1 John Brestelli  4   5 Brian P Brunk  8 Mark Caddick  1 Danielle Callan  4   5 Lahcen Campbell  3 Mikkel B Christensen  3 George K Christophides  9 Kathryn Crouch  6 Kristina Davis  10 Jeremy DeBarry  2 Ryan Doherty  4   8 Yikun Duan  4   8 Michael Dunn  10 Dave Falke  2 Steve Fisher  4   5 Paul Flicek  3 Brett Fox  10 Bindu Gajria  8 Gloria I Giraldo-Calderón  11   12 Omar S Harb  8 Elizabeth Harper  8 Christiane Hertz-Fowler  1 Mark J Hickman  8 Connor Howington  10 Sufen Hu  4   8 Jay Humphrey  2 John Iodice  4   5 Andrew Jones  1 John Judkins  4   8 Sarah A Kelly  9 Jessica C Kissinger  2   13   14 Dae Kun Kwon  15 Kristopher Lamoureux  2 Daniel Lawson  9 Wei Li  4   8 Kallie Lies  10 Disha Lodha  3 Jamie Long  8 Robert M MacCallum  9 Gareth Maslen  3 Mary Ann McDowell  11 Jaroslaw Nabrzyski  10 David S Roos  8 Samuel S C Rund  11 Stephanie Wever Schulman  8 Achchuthan Shanmugasundram  1 Vasily Sitnik  3 Drew Spruill  2 David Starns  1 Christian J Stoeckert  4   5 Sheena Shah Tomko  8 Haiming Wang  2 Susanne Warrenfeltz  2 Robert Wieck  10 Paul A Wilkinson  1 Lin Xu  4   8 Jie Zheng  4   5
Affiliations

VEuPathDB: the eukaryotic pathogen, vector and host bioinformatics resource center

Beatrice Amos et al. Nucleic Acids Res. .

Abstract

The Eukaryotic Pathogen, Vector and Host Informatics Resource (VEuPathDB, https://veupathdb.org) represents the 2019 merger of VectorBase with the EuPathDB projects. As a Bioinformatics Resource Center funded by the National Institutes of Health, with additional support from the Welllcome Trust, VEuPathDB supports >500 organisms comprising invertebrate vectors, eukaryotic pathogens (protists and fungi) and relevant free-living or non-pathogenic species or hosts. Designed to empower researchers with access to Omics data and bioinformatic analyses, VEuPathDB projects integrate >1700 pre-analysed datasets (and associated metadata) with advanced search capabilities, visualizations, and analysis tools in a graphic interface. Diverse data types are analysed with standardized workflows including an in-house OrthoMCL algorithm for predicting orthology. Comparisons are easily made across datasets, data types and organisms in this unique data mining platform. A new site-wide search facilitates access for both experienced and novice users. Upgraded infrastructure and workflows support numerous updates to the web interface, tools, searches and strategies, and Galaxy workspace where users can privately analyse their own data. Forthcoming upgrades include cloud-ready application architecture, expanded support for the Galaxy workspace, tools for interrogating host-pathogen interactions, and improved interactions with affiliated databases (ClinEpiDB, MicrobiomeDB) and other scientific resources, and increased interoperability with the Bacterial & Viral BRC.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
VEuPathDB redesigned homepage. (A) The banner section is present on all webpages and contains the site search box and access to all searches, data, and tools. (B) Just below the banner is a section reserved for general community announcements (C) The left-hand search panel contains categorized searches of data in the resource. (D) The Overview of resources and tools section is in the centre of the page, providing quick vignettes to help our users get started with a specific topic. (E) Links to more detailed step-by-step instructional exercises are in the centre of the page just above the footer. (F) The news and tweets section is an expandable tab (collapsed by default) offering users access to recent news releases and tweets. (G) The footer section includes hyperlinked logos to all VEuPathDB component and affiliated sites in addition to the Gitter chat button.
Figure 2.
Figure 2.
Updated strategy system interface. (A) example strategy showing graphic interface for exploring relationships across data sets and organisms. Final strategy can be found here https://toxodb.org/toxo/app/workspace/strategies/import/037c32a7060e4c90. Upper right corner shows actions that can be performed on the strategy: copy, add/edit description, save, share, delete and close. (B) Home page Search for menu often used to choose the first search in a strategy. (C) Result of first step with the graphic of the growing strategy in the top panel, redesigned vertical organism filter on the left and the gene result table on the right. (D) Redesigned Add Step popup. The three choices when adding a step are aligned on the right and details appear in the right panel once a left panel option is chosen.
Figure 3.
Figure 3.
MapVEu tool for visualizing and downloading spatially resolved data. (A) The MapVEu interface includes a left-side menu for accessing tools (blue arrow), a search and filtering bar at the top of the screen (green arrow), and a lower right-side legend (green box). Circular markers indicate the location of data collections. Clicking on these markers will zoom and disaggregate them into more spatially resolved points. (B) Tool for choosing data type of interest (enlarged search and filtering bar seen in A, green arrow). The map can display several data types including Samples, Insecticide Resistance, Genotypes, Abundance, Pathogens and Blood Meals. Users can select one of several views to see a specific data type on the map. (C) For many data types, specialized representations are available from the graph tab of the tool menu. Shown here are species abundance counts and insecticide resistance assays. (D) The tool menus’ data table tab highlights the depth of metadata recorded from each sample. (E) All data can be downloaded using the export data tab. F. Visible data can be filtered by adding search terms to the filter bar, which has auto-complete functionality.
Figure 4.
Figure 4.
VEuPathDB data production workflow and architecture. The complete pathway from data acquisition to web presentation and utilization by users is detailed. Production activities and systems are represented in the bottom purple box and the services and presentation layers are represented in the pink and grey boxes. Data enter the system in the data staging box where they are identified and prioritized by Outreach and entered into the Redmine issue tracking system. Once data are cleaned and structured, datasets are available to the processing and integration workflows. Genome sequence, annotation, RNA Seq and DNA sequencing reads are processed at the EBI (bottom box) and passed back to Penn (top box) for data integration and subsequent processing including integration of functional data and ortholog assignment. Data are prepared by these workflows for presentation in the form of relational databases and indexed flat files. The web clients provide access to users via a set of services that communicate with the back-end data stores. The system also includes a user data analysis system (right side) enabling users to analyse their own data and, for some datatypes, import their results into VEuPathDB for analysis and integration with publicly available data.

References

    1. Giraldo-Calderon G.I., Emrich S.J., MacCallum R.M., Maslen G., Dialynas E., Topalis P., Ho N., Gesing S., VectorBase C., Madey G. et al. . VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases. Nucleic Acids Res. 2015; 43:D707–D713. - PMC - PubMed
    1. Aurrecoechea C., Barreto A., Basenko E.Y., Brestelli J., Brunk B.P., Cade S., Crouch K., Doherty R., Falke D., Fischer S. et al. . EuPathDB: the eukaryotic pathogen genomics database resource. Nucleic Acids Res. 2017; 45:D581–D591. - PMC - PubMed
    1. Davidson S.B., Crabtree J., Brunk B., Schug J., Tannen V., Overton G.C., Stoeckert J., C. J. K2/Kleisli and GUS: experiments in integrated access to genomic data sources. IBM Syst. J. 2001; 40:512–531.
    1. Fischer S., Aurrecoechea C., Brunk B.P., Gao X., Harb O.S., Kraemer E.T., Pennington C., Treatman C., Kissinger J.C., Roos D.S. et al. . The Strategies WDK: a graphical search interface and web development kit for functional genomics databases. Database (Oxford). 2011; 2011:bar027. - PMC - PubMed
    1. Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M., Davis A.P., Dolinski K., Dwight S.S., Eppig J.T. et al. . Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 2000; 25:25–29. - PMC - PubMed

Publication types

MeSH terms