Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015:2015:180749.
doi: 10.1155/2015/180749. Epub 2015 Oct 12.

Convalescing Cluster Configuration Using a Superlative Framework

Affiliations

Convalescing Cluster Configuration Using a Superlative Framework

R Sabitha et al. ScientificWorldJournal. 2015.

Abstract

Competent data mining methods are vital to discover knowledge from databases which are built as a result of enormous growth of data. Various techniques of data mining are applied to obtain knowledge from these databases. Data clustering is one such descriptive data mining technique which guides in partitioning data objects into disjoint segments. K-means algorithm is a versatile algorithm among the various approaches used in data clustering. The algorithm and its diverse adaptation methods suffer certain problems in their performance. To overcome these issues a superlative algorithm has been proposed in this paper to perform data clustering. The specific feature of the proposed algorithm is discretizing the dataset, thereby improving the accuracy of clustering, and also adopting the binary search initialization method to generate cluster centroids. The generated centroids are fed as input to K-means approach which iteratively segments the data objects into respective clusters. The clustered results are measured for accuracy and validity. Experiments conducted by testing the approach on datasets from the UC Irvine Machine Learning Repository evidently show that the accuracy and validity measure is higher than the other two approaches, namely, simple K-means and Binary Search method. Thus, the proposed approach proves that discretization process will improve the efficacy of descriptive data mining tasks.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Proposed framework.
Figure 2
Figure 2
Discretization framework.
Figure 3
Figure 3
Process of discretization.
Figure 4
Figure 4
Accuracy of proposed method with and without Phase I.
Figure 5
Figure 5
Validity of proposed method with and without Phase I.
Figure 6
Figure 6
Comparative analysis of the algorithms based on accuracy and DB index.
Algorithm 1
Algorithm 1
Steps in discretization.
Algorithm 2
Algorithm 2
Identifying initial centroids.
Algorithm 3
Algorithm 3
K-means clustering.

References

    1. Pujari A. K. Data Mining Techniques. Hyderabad, India: University Press; 2001.
    1. Tan P., Steinbach M., Kumar V. Introduction to Data Mining. Pearson Addison-Wesley; 2006.
    1. Larose D. T. Discovering Knowledge in Data—An Introduction to Data Mining. New York, NY, USA: John Wiley & Sons; 2005.
    1. Hegland M. Data Mining—Challenges, Models, Methods and Algorithms. ANU Data Mining Group; 2003. (Draft).
    1. Freitas A. A. Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer; 2002.

LinkOut - more resources