Clustering large data with uncertainty

Ghosh, Sampreeti ; Mitra, Sushmita (2013) Clustering large data with uncertainty Applied Soft Computing, 13 (4). pp. 1639-1645. ISSN 1568-4946

Full text not available from this repository.

Official URL: https://doi.org/10.1016/j.asoc.2012.12.036

Related URL: http://dx.doi.org/10.1016/j.asoc.2012.12.036

Abstract

A new algorithm is designed for handling fuzziness while mining large data. A new novel cost function weighted by fuzzy membership, is proposed in the framework of CLARANS. A new scalable approximation to the maximum number of neighbors, explored at each node, is developed; thus reducing the computational time for large data while eliminating the need for user-defined (heuristic) parameters in the existing equation. The goodness of the generated clusters is evaluated in terms of Xie–Beni validity index. Results demonstrate the superiority of the proposed algorithm, over both synthetic and real data sets, in terms of goodness of clustering. It is interesting to note that our algorithm always converges to the globally best values at the optimal number of partitions. Moreover compared to existing fuzzy algorithms, FCLARANS without scanning the whole dataset, searching small number of neighbors, is able to handle the uncertainty due to overlapping nature of the various partitions. This is the main motivation of fuzzification of the algorithm CLARANS.

Item Type:Article
Source:Copyright of this article belongs to Elsevier Science.
ID Code:140178
Deposited On:07 Sep 2025 06:23
Last Modified:07 Sep 2025 06:23

Repository Staff Only: item control page