Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2015 Nov;12(11):1033-8.
doi: 10.1038/nmeth.3583. Epub 2015 Sep 21.

Comparing the performance of biomedical clustering methods

Affiliations
Comparative Study

Comparing the performance of biomedical clustering methods

Christian Wiwie et al. Nat Methods. 2015 Nov.

Abstract

Identifying groups of similar objects is a popular first step in biomedical data analysis, but it is error-prone and impossible to perform manually. Many computational methods have been developed to tackle this problem. Here we assessed 13 well-known methods using 24 data sets ranging from gene expression to protein domains. Performance was judged on the basis of 13 common cluster validity indices. We developed a clustering analysis platform, ClustEval (http://clusteval.mpi-inf.mpg.de), to promote streamlined evaluation, comparison and reproducibility of clustering results in the future. This allowed us to objectively evaluate the performance of all tools on all data sets with up to 1,000 different parameter sets each, resulting in a total of more than 4 million calculated cluster validity indices. We observed that there was no universal best performer, but on the basis of this wide-ranging comparison we were able to develop a short guideline for biomedical clustering tasks. ClustEval allows biomedical researchers to pick the appropriate tool for their data type and allows method developers to compare their tool to the state of the art.

PubMed Disclaimer

References

    1. Genome Biol. 2006;7(1):R8 - PubMed
    1. Nat Biotechnol. 2005 Dec;23(12):1499-501 - PubMed
    1. BMC Bioinformatics. 2007 Oct 17;8:396 - PubMed
    1. Nat Methods. 2012 Mar 18;9(5):471-2 - PubMed
    1. Bioinformatics. 2013 Jan 15;29(2):215-22 - PubMed

Publication types