Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Jan 22:9:31.
doi: 10.1186/1471-2105-9-31.

The comparative analysis of statistics, based on the likelihood ratio criterion, in the automated annotation problem

Affiliations

The comparative analysis of statistics, based on the likelihood ratio criterion, in the automated annotation problem

Andrey M Leontovich et al. BMC Bioinformatics. .

Abstract

Background: This paper discusses the problem of automated annotation. It is a continuation of the previous work on the A4-algorithm (Adaptive algorithm of automated annotation) developed by Leontovich and others.

Results: A number of new statistics for the automated annotation of biological sequences is introduced. All these statistics are based on the likelihood ratio criterion.

Conclusion: Some of the statistics yield a prediction quality that is significantly higher (up to 1.5 times higher) in comparison with the results obtained with the A4-procedure.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A4 algorithm scheme.
Figure 2
Figure 2
ROC curves for the best variants of the statistics. The figure shows the ROC curves (1–5) for the best variants of the statistics (i.e., for variants marked in Table 2). Curve 1 corresponds to η[1,1]; curve 2 corresponds to T(1)[1,0]; curve 3 corresponds to T(2)[1,1]; curve 4 corresponds to T^(2)[0,1]; curve 5 corresponds to q.

References

    1. Fleischmann W, Moller S, Gateau A, Apweiler R. A novel method for automatic functional annotation of proteins. Bioinformatics. 1999;15:228–233. doi: 10.1093/bioinformatics/15.3.228. - DOI - PubMed
    1. Andrade MA, Brown NP, Leroy C, Hoersch S, de Daruvar A, Reich C, Franchini A, Tamames J, Valencia A, Ouzounis C, Sander C. Automated genome sequence analysis and annotation. Bioinformatics. 1999;15:391–412. doi: 10.1093/bioinformatics/15.5.391. - DOI - PubMed
    1. Kretschmann E, Fleischmann W, Apweiler R. Automatic rule generation for protein annotation with the C4.5 data mining algorithm applied on SWISS-PROT. Bioinformatics. 2001;17:920–926. doi: 10.1093/bioinformatics/17.10.920. - DOI - PubMed
    1. Hegyi H, Gerstein M. Annotation transfer for genomics: Measuring functional divergence in multi-domain proteins. Genome Research. 2001;11:1632–1640. 10.1101/gr. 183801. - PMC - PubMed
    1. Leontovich AM, Brodsky LI, Drachev VA, Nikolaev VK. Adaptive algorithm of automated annotation. Bioinformatics. 2002;18:838–846. doi: 10.1093/bioinformatics/18.6.838. - DOI - PubMed

Publication types

LinkOut - more resources