Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Sep:Chapter 4:4.6.1-4.6.10.
doi: 10.1002/0471250953.bi0406s35.

Eukaryotic gene prediction using GeneMark.hmm-E and GeneMark-ES

Affiliations

Eukaryotic gene prediction using GeneMark.hmm-E and GeneMark-ES

Mark Borodovsky et al. Curr Protoc Bioinformatics. 2011 Sep.

Abstract

This unit describes how to use the gene-finding programs GeneMark.hmm-E and GeneMark-ES for finding protein-coding genes in the genomic DNA of eukaryotic organisms. These bioinformatics tools have been demonstrated to have state-of-the-art accuracy for many fungal, plant, and animal genomes, and have frequently been used for gene annotation in novel genomic sequences. An additional advantage of GeneMark-ES is that the problem of algorithm parameterization is solved automatically, with parameters estimated by iterative self-training (unsupervised training).

PubMed Disclaimer

Figures

Figure 1
Figure 1
The user interface for the eukaryotic GeneMark.hmm program. Required input includes a DNA sequence in FASTA format, either copied and pasted into the Sequence text box, or uploaded from one’s local drive using the Browse button next to the Sequence File Upload box.
Figure 2
Figure 2
The text output for the Eukaryotic GeneMark.hmm program.
Figure 3
Figure 3
The graphical output from the Eukaryotic GeneMark.hmm program for a region of the example sequence. The six different panels represent the six possible reading frames, three each on the direct and reverse strands.

References

    1. Altschul SF, Gish W, et al. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. - PubMed
    1. Besemer J, Lomsadze A, Borodovsky M. GeneMarkS: A self-training method for prediction of gene starts in microbial genomes: Implications for finding sequence motifs in regulatory regions. Nucleic Acids Res. 2001;29:2607–2618. - PMC - PubMed
    1. Borodovsky M, McIninch J. GeneMark: Parallel gene recognition for both DNA strands. Comput. Chem. 1993;17:123–133.
    1. Lukashin AV, Borodovsky M. GeneMark.hmm: New solutions for gene finding. Nucleic Acids Res. 1998;26:1107–1115. - PMC - PubMed
    1. Lomsadze A, Ter-Hovhannisyan V, Chernoff Y, Borodovsky M. Gene identification in novel eukaryotic genomes by self-training algorithm. Nucleic Acids Research. 2005;Vol. 33(No. 20):6494–6506. - PMC - PubMed

Publication types