Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar;20(1):e3.
doi: 10.5808/gi.20038. Epub 2022 Mar 31.

Whole-genome sequence analysis through online web interfaces: a review

Affiliations

Whole-genome sequence analysis through online web interfaces: a review

A W A C W R Gunasekara et al. Genomics Inform. 2022 Mar.

Abstract

The recent development of whole-genome sequencing technologies paved the way for understanding the genomes of microorganisms. Every whole-genome sequencing (WGS) project requires a considerable cost and a massive effort to address the questions at hand. The final step of WGS is data analysis. The analysis of whole-genome sequence is dependent on highly sophisticated bioinformatics tools that the research personal have to buy. However, many laboratories and research institutions do not have the bioinformatics capabilities to analyze the genomic data and therefore, are unable to take maximum advantage of whole-genome sequencing. In this aspect, this study provides a guide for research personals on a set of bioinformatics tools available online that can be used to analyze whole-genome sequence data of bacterial genomes. The web interfaces described here have many advantages and, in most cases exempting the need for costly analysis tools and intensive computing resources.

Keywords: average nucleotide identitys; online web servers; single nucleotide polymorphisms; virulence factors; whole-genome sequencing.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest

No potential conflict of interest relevant to this article was reported.

Figures

Fig. 1.
Fig. 1.
General overview of high throughput sequencing workflow of a bacterial genome. Following genome assembly, online web interfaces can be utilized for the purpose of analyzing WGS. MLST, multilocus sequence typing; WGS, whole-genome sequencing.
Fig. 2.
Fig. 2.
General subsystem features and KEGG pathway of drug metabolism of Vibrio parahaemolyticus 3HPAHPND genome through RAST server (Different colors in the subsystem category distribution indicates different subsystem features whereas KEGG pathway indicates the functions for V. parahaemolyticus 3HPAHPND genome). KEGG, Kyoto Encyclopedia of Genes and Genomes.
Fig. 3.
Fig. 3.
CGE server workflow of classical MLST typing and SNP calling on selected Vibrio parahaemolyticus genomes. (A) Clinical V. parahaemolyticus genome RIMD 221063 was used as a reference for SNP calling. (B) The V. parahaemolyticus 3HPAHPND genome was used for the in silco MLST analysis. MLST, multilocus sequence typing; SNP, single nucleotide polymorphism.
Fig. 4.
Fig. 4.
(A) The cano-wgMLST server workflow of wgMLST phylogeny and identification of highly discriminatory genes on 5 Vibrio parahaemolyticus genomes. (B) JSpeciesWS server workflow of ANI among 5 Vibrio parahaemolyticus genomes. ANI, average nucleotide identity; wgMLST, whole-genome multilocus sequence typing.
Fig. 5.
Fig. 5.
(A) Virulence factor analysis workflow of VFDB server. (B) Prophage analysis workflow of PHASTER server. (C) Cluster of orthologous group (COG) analysis workflow of WebMGA server. Vibrio parahaemolyticus genome 3HPAHPND was used as a reference for all the applications.
Fig. 6.
Fig. 6.
Graphical illustration of chromosome I for 5 Vibrio parahaemolyticus genomes by GView server. Different colors in the genome map indicate different genomes. Clinical Vibrio parahaemolyticus genome RIMD 221063 was used as the reference for the genomic mapping.

References

    1. Sanger F, Nicklen S, Coulson AR. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A. 1977;74:5463–5467. - PMC - PubMed
    1. Staden R. A strategy of DNA sequencing employing computer programs. Nucleic Acids Res. 1979;6:2601–2610. - PMC - PubMed
    1. Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, et al. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995;269:496–512. - PubMed
    1. Austin CP. The impact of the completed human genome sequence on the development of novel therapeutics for human disease. Annu Rev Med. 2004;55:1–13. - PubMed
    1. Hood L, Rowen L. The Human Genome Project: big science transforms biology and medicine. Genome Med. 2013;5:79. - PMC - PubMed

LinkOut - more resources