Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 May;11(3):444-9.
doi: 10.1016/j.mito.2010.12.016. Epub 2010 Dec 31.

A critical analysis of the combined usage of protein localization prediction methods: Increasing the number of independent data sets can reduce the accuracy of predicted mitochondrial localization

Affiliations

A critical analysis of the combined usage of protein localization prediction methods: Increasing the number of independent data sets can reduce the accuracy of predicted mitochondrial localization

Kieren T Lythgow et al. Mitochondrion. 2011 May.

Abstract

In the absence of a comprehensive experimentally derived mitochondrial proteome, several bioinformatic approaches have been developed to aid the identification of novel mitochondrial disease genes within mapped nuclear genetic loci. Often, many classifiers are combined to increase the sensitivity and specificity of the predictions. Here we show that the greatest sensitivity and specificity are obtained by using a combination of seven carefully selected classifiers. We also show that increasing the number of independent prediction methods can paradoxically decrease the accuracy of predicting mitochondrial localization. This approach will help to accelerate the identification of new mitochondrial disease genes by providing a principled way for the selection for combination of appropriate prediction methods of mitochondrial localization of proteins.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Sensitivity of different combinations of prediction tools in rank order. Left hand = detail of the top 100 combinations.
Fig. 2
Fig. 2
Range of mean (a) specificity and (b) sensitivity values for different combinations (n = 2047) of the eleven different prediction tools. Horizontal bar = mean, box = standard deviation, whiskers = range.
Fig. 3
Fig. 3
The contribution of each prediction tool to mean specificity (a and b); and mean sensitivity (c and d) for 100 training and test runs of the SVM. In this figure, “probability” refers to the proportion of combinations where a specific prediction tool contributed to a result with a given specificity and sensitivity. (a) Heat contour plot showing the probability of a specific tool or data set in contributing to a given level of specificity. Red = high probability, yellow = low probability. (b) Probability of each prediction tool contributing to a given level of specificity for the top 100 combinations. (c) Heat contour plot showing the probability of a specific tool in contributing to a given level of sensitivity. Red = high probability, yellow = low probability. (d) Probability of each prediction tool contributing to a given level of sensitivity for the prediction tools generating the top 100 mean sensitivity values.
Fig. 4
Fig. 4
Relationship between sensitivity and specificity for the 2047 different combinations of the prediction tools. Color code = the number of prediction tools incorporated in the SVM prediction. Note the disproportionate axes.

References

    1. Andreoli C., Prokisch H., Hortnagel K., Mueller J.C., Munsterkotter M., Scharfe C., Meitinger T. MitoP2, an integrated database on mitochondrial proteins in yeast and man. Nucleic Acids Res. 2004;32:D459–D462. - PMC - PubMed
    1. Bannai H., Tamada Y., Maruyama O., Nakai K., Miyano S. Extensive feature detection of N-terminal protein sorting signals. Bioinformatics. 2002;18:298–305. - PubMed
    1. Baum P.A., Haussler D. What size net gives valid generalization? Neural Comput. 1989;1:150–160.
    1. Bourdon A., Minai L., Serre V., Jais J.P., Sarzi E., Aubert S., Chretien D., de Lonlay P., Paquis-Flucklinger V., Arakawa H., Nakamura Y., Munnich A., Rotig A. Mutation of RRM2B, encoding p53-controlled ribonucleotide reductase (p53R2), causes severe mitochondrial DNA depletion. Nat. Genet. 2007;39:776–780. - PubMed
    1. Bykhovskaya Y., Casas K., Mengesha E., Inbal A., Fischel-Ghodsian N. Missense mutation in pseudouridine synthase 1 (PUS1) causes mitochondrial myopathy and sideroblastic anemia (MLASA) Am. J. Hum. Genet. 2004;74:1303–1308. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources