Piotr Kulczycki, Małgorzata Charytanowicz
Complete Gradient Clustering Algorithm Formed with Kernel Estimators
Applied Mathematics and Computer Science, vol. 20, no. 1, 2010, to appear.

Abstract:
The aim of this paper is to provide a gradient clustering algorithm in its complete form, suitable for direct use without requiring a deeper statistical knowledge. The values of all parameters are effectively calculated using optimizing procedures. Moreover, an illustrative analysis of the meaning of particular parameters is shown, followed by the effects resulting from possible modifications with respect to their primarily assigned optimal values. The proposed algorithm does not demand strict assumptions regarding the desired number of clusters, which allows the number obtained to be better suited to a real data structure. Moreover, a feature specific to it is the possibility to influence the proportion between the number of clusters in areas where data elements are dense as opposed to their sparse regions. And finally, the algorithm - by the detection of one-element clusters - allows the identification of atypical elements, which enables their elimination or designation to more numerous clusters, thus increasing the homogeneity of the data set.

Key words:
data analysis and mining, clustering, gradient procedures, nonparametric statistical methods, kernel estimators, numerical calculations.

[DRAFT PDF]