Convalescing Cluster Configuration Using a Superlative Framework

R Sabitha¹, S Karthik²

Affiliations

¹ Department of Information Technology, Info Institute of Engineering, Coimbatore 641107, India.
² Department of CSE, SNS College of Technology, Coimbatore 641035, India.

PMID: 26543895
PMCID: PMC4620246
DOI: 10.1155/2015/180749

Convalescing Cluster Configuration Using a Superlative Framework

R Sabitha et al. ScientificWorldJournal. 2015.

. 2015:2015:180749.

doi: 10.1155/2015/180749. Epub 2015 Oct 12.

Authors

R Sabitha¹, S Karthik²

Affiliations

¹ Department of Information Technology, Info Institute of Engineering, Coimbatore 641107, India.
² Department of CSE, SNS College of Technology, Coimbatore 641035, India.

PMID: 26543895
PMCID: PMC4620246
DOI: 10.1155/2015/180749

Abstract

Competent data mining methods are vital to discover knowledge from databases which are built as a result of enormous growth of data. Various techniques of data mining are applied to obtain knowledge from these databases. Data clustering is one such descriptive data mining technique which guides in partitioning data objects into disjoint segments. K-means algorithm is a versatile algorithm among the various approaches used in data clustering. The algorithm and its diverse adaptation methods suffer certain problems in their performance. To overcome these issues a superlative algorithm has been proposed in this paper to perform data clustering. The specific feature of the proposed algorithm is discretizing the dataset, thereby improving the accuracy of clustering, and also adopting the binary search initialization method to generate cluster centroids. The generated centroids are fed as input to K-means approach which iteratively segments the data objects into respective clusters. The clustered results are measured for accuracy and validity. Experiments conducted by testing the approach on datasets from the UC Irvine Machine Learning Repository evidently show that the accuracy and validity measure is higher than the other two approaches, namely, simple K-means and Binary Search method. Thus, the proposed approach proves that discretization process will improve the efficacy of descriptive data mining tasks.

PubMed Disclaimer

Figures

**Figure 4**
Accuracy of proposed method with and without Phase I.

**Figure 5**
Validity of proposed method with and without Phase I.

**Figure 6**
Comparative analysis of the algorithms based on accuracy and DB index.

**Algorithm 1**
Steps in discretization.

**Algorithm 2**
Identifying initial centroids.

See this image and copyright information in PMC

References

1. Pujari A. K. Data Mining Techniques. Hyderabad, India: University Press; 2001.
1. Tan P., Steinbach M., Kumar V. Introduction to Data Mining. Pearson Addison-Wesley; 2006.
1. Larose D. T. Discovering Knowledge in Data—An Introduction to Data Mining. New York, NY, USA: John Wiley & Sons; 2005.
1. Hegland M. Data Mining—Challenges, Models, Methods and Algorithms. ANU Data Mining Group; 2003. (Draft).
1. Freitas A. A. Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer; 2002.

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Convalescing Cluster Configuration Using a Superlative Framework

Affiliations

Convalescing Cluster Configuration Using a Superlative Framework

Authors

Affiliations

Abstract

Figures

References

LinkOut - more resources

Full Text Sources

Other Literature Sources