Open Access Open Access  Restricted Access Subscription or Fee Access

Get Better Accuracy and Quality of Clustering Using Variation of K Means

Maikal Rana, Amit Chauhan

Abstract


Clustering analysis is a challenging task and there area number of issues associated with it, e.g. accuracy, quality, efficiency,finding cluster of different shape, size and density finding clusters which are sensitive to noise and outliers. K-means clustering algorithms are widely used for many practical applications. Original k-mean algorithms select initial centroids randomly that affect the accuracy and quality of the resulting clusters and sometimes it generates poor and empty clusters which are meaningless. Our approach for the K-mean algorithm eliminates the deficiency of exiting K-mean algorithm and Improve accuracy and generate high quality cluster by reducing mean square error.


Keywords


Centroids, Cluster Analysis, K-Means, Mean Square Error

Full Text:

PDF

References


Baraldi International Computer Science Institute; P. Blonda, a Survey of Fuzzy Clustering. International Computer Science Institute. 1998.

G.K. Gupta, “Introduction to Data Mining with Case Studies”, PHI, 2006.

Maikal Rana,Bhensdadia C.K, Ganatra A.P, “Generate stable cluster using K mean” 3rd international conference on data management, IMT Gaziabad, 11th 12th March, 2010.

Paval Berkhin, Accure Software Inc, “Survey of clustering Data mining Techniques”

Rui Xu, Student Member, IEEE and Donald Wunsch II, Fellow, IEEE “Survey of Clustering algorithm”, IEEE Transactions on Neural Networks, Vol. 16, No.3, May (2005).

Sanjay garg, Ramesh Chandra Jain, “Variation of k- mean Algorithm: A study for High dimensional Large data sets”, Information Technology Journal5 (6), 1132 – 1135, 2006.

Salem A.M., Fahim A.M, Torkey F.A, Ramdan M.A, “An efficient enhance k-means clustering algorithm”, Journal of Zhejiang university Science,2006 7(10):1626-1633

Yuan F, Meng Z.H, Zhang H.X and Dong C.R, “A New Algorithm to Get the Initial Centroids,” Proc. of the 3rd International Conference on Machine Learning and Cybernetics, pages 26–29, August 2004.

http://wapedia.mobi/en/K-means

Jiawei Han, Micheline Kamber,”Data Mining–Concepts and

Techniques”, Morgan Kaufmann Publishers, 2001.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.