Open Access Subscription or Fee Access
A New Fuzzy Based Data Clustering Using EM Algorithm and Green IT
One of the fundamental difficulties that arise in several fields, comprising pattern recognition, machine learning and statistics, is clustering. The basic data clustering problem might be defined as finding out groups in data or grouping related objects together. A cluster is a group of objects which are similar to each other within a cluster and are dissimilar to the objects of other clusters. The similarity is typically calculated on the basis of distance between two objects or clusters. Two or more objects present inside a cluster and only if those objects are close to each other based on the distance between them. The major objective of clustering is to discover collection of comparable objects based on similarity metric. On the other hand, a similarity metric is generally specified by the user according to his requirements for obtaining better results. So far, there is no such technique available which absolutely fits for all applications. Some of the major difficulties concerning the existing available clustering approaches are that they do not concentrate on the entire needs effectively and require huge time complexity in case of clustering a great number of dimensions and bulky data sets. Efficiency of a particular clustering approach chiefly based on the definition of the distance, means that the measure of distance between the two objects in a particular cluster should be well defined using effective distance measures. Also it is necessary to know about the effect of constraints in clustering the objects. The use of constraints in clustering along with the effective distance measures will definitely provide better clustering results.
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution 3.0 License.