Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2002 Jul 30:3:19.
doi: 10.1186/1471-2105-3-19.

Visualizing the genome: techniques for presenting human genome data and annotations

Affiliations

Visualizing the genome: techniques for presenting human genome data and annotations

Ann E Loraine et al. BMC Bioinformatics. .

Abstract

Background: In order to take full advantage of the newly available public human genome sequence data and associated annotations, biologists require visualization tools ("genome browsers") that can accommodate the high frequency of alternative splicing in human genes and other complexities.

Results: In this article, we describe visualization techniques for presenting human genomic sequence data and annotations in an interactive, graphical format. These techniques include: one-dimensional, semantic zooming to show sequence data alongside gene structures; color-coding exons to indicate frame of translation; adjustable, moveable tiers to permit easier inspection of a genomic scene; and display of protein annotations alongside gene structures to show how alternative splicing impacts protein structure and function. These techniques are illustrated using examples from two genome browser applications: the Neomorphic GeneViewer annotation tool and ProtAnnot, a prototype viewer which shows protein annotations in the context of genomic sequence.

Conclusion: By presenting techniques for visualizing genomic data, we hope to provide interested software developers with a guide to what features are most likely to meet the needs of biologists as they seek to make sense of the rapidly expanding body of public genomic data and annotations.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Annotated GeneViewer screen capture showing FLJ22324, a six-exon gene inferred from a cDNA-to-genomic sequence alignment.(a) The high-level structure at low zoom. (b) A close-up view of a questionable small intron separating exons 5 and 6. Dinucleotide bases at this intron's 5' and 3' boundaries are underlined.
Figure 2
Figure 2
Hypothetical example showing how semantic zooming could be used to represent gene structure annotations based on cDNA-to-genomic sequence alignments.(a) Low zoom. (b) High zoom.
Figure 3
Figure 3
SNURF locus.(a) The full scene is shown with multiple annotation types sorted into labeled tiers. (b) A simplified scene is shown in which several tiers shown in (a) have been hidden, collapsed, or moved to new positions. The horizontal slider has been used to expand the display in the vertical direction.
Figure 4
Figure 4
Using color to represent frame of translation at the ARG1 locus. Coding regions in each exon are colored according to which frame of the genomic sequence is translated. A different color for overlapping exons from different transcripts indicates these exons are translated in different frames.
Figure 5
Figure 5
Protein motifs detected by Pfam are displayed beneath alternative transcript structures (a,b) at the PLAT locus. Alignments between genomic sequence and cDNA sequences (a) BC002795.1 and (b) NM_0009301 are shown. Regions encoding matches to Pfam motifs PF00008 (EGF-like domain), PF00051 (Kringle), PF00089 (Serine proteases, trypsin family), and PF00039 (Type I fibronectin) are shown as linked green rectangles below each alignment.

Similar articles

Cited by

References

    1. Burge C, Karlin S. Prediction of complete gene structures in human genomic DNA. J Mol Biol. 1997;268:78–94. doi: 10.1006/jmbi.1997.0951. - DOI - PubMed
    1. Kulp D, Haussler D, Reese MG, Eeckman FH. A generalized hidden Markov model for the recognition of human genes in DNA. Proc Int Conf Intell Syst Mol Biol. 1996;4:134–42. - PubMed
    1. http://www.ncbi.nlm.nih.gov
    1. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D. The human genome browser at UCSC. Genome Res. 2002;12:996–1006. 10.1101/gr.229102. Article published online before print in May 2002. - PMC - PubMed
    1. Helt GA, Lewis S, Loraine AE, Rubin GM. BioViews: Java-based tools for genomic data visualization. Genome Res. 1998;8:291–305. - PMC - PubMed

MeSH terms

LinkOut - more resources