true pt
A Brief Overview of Prototype Based Clustering Techniques
Olfa Nasraoui
Department of Computer Engineering
& Computer Science
University of Louisville,
olfa.nasraoui_AT_louisville.edu
Clustering aims at classifying the unlabeled points in a data set into different groups or clusters, such that members of the same cluster are as similar as possible, while members of different clusters are as dissimilar as possible. Because there is no a priori knowledge about the class labels, clustering is also called unsupervised classification. Several approaches to clustering exist, and probably differ because they have originated in different domains of artificial intelligence and statistics. For example, graph-theoretic and tree-based techniques are popular in the machine learning community; while objective function-driven or prototype-based clustering methods such as the
Means and Gaussian Mixture modeling have long been used in statistical pattern recongnition. We concentrate on prototype based clustering methods because they lend themselves more easily to robustifying efforts based on robust statistics. Most prototype based clustering methods are based on the
Means and its fuzzy counterpart, the Fuzzy
Means (FCM) [Bez81] algorithms.
Let
be a set of feature vectors in an
dimensional feature space with coordinate axis labels
, where
. Let
represent a
-tuple of prototypes each of which characterizes one of the
clusters. Each
consists of a set of parameters. The
Means has the following objective
 |
(1) |
where
represents the distance from a feature point
to the prototype
, and
, the
cluster, is given by
 |
(2) |
The optimal prototype parameters
of the
cluster are derived by setting
.
For instance if
is the squared Euclidean distance
, then the center
is given by
 |
(3) |
The
Means algorithm consists of alternating updates of the centers using (3) and the partition using (2), until convergence or for a maximum number of iterations.
It is known that for complex data sets containing overlapping clusters, fuzzy partitions model the data better than
their crisp counterparts. In particular, fuzzy memberships are richer than crisp memberships in describing the degrees of belongingness of data points lying in the areas of overlap. Moreover, fuzzy partitions generally make the optimization process less prone to local or sub-optimal solutions. With a fuzzy partition, a data point
belongs to each cluster,
, to a varying degree called fuzzy membership
. A fuzzy partiton, usually represented by the
matrix
is called a constrained fuzzy
partition of
if the entries of
satisfy the following constraints [Bez81],
The Fuzzy
Means (FCM) [Rus69,Dun74,Bez81] algorithm uses the following criterion:
 |
(4) |
The optimal prototype parameters
of the
cluster are derived by setting
.
For instance if
is the squared Euclidean distance, then the center
is given by
 |
(5) |
The optimal FCM memberships for (4) can be shown to be [Bez81]
 |
(6) |
Therefore, the optimization process consists of alternating updates of the memberships, as given by (6), and the cluster centers as given by (5). The fuzzy memberships allow each data point to belong to all clusters to a varying degree of membership. Hence, fuzzy partitions involve less commitment than their hard or crisp counterparts in every step of the optimization process. This results in a lower sensitivity to initialization. By changing the distance measure in the objective function of the FCM, it can be generalized to seek clusters of various shapes such as lines, curves, planar and quadric surfaces [GK79,KFN95]
This work is supported by the National Science Foundation (CAREER Award IIS-0133948 to O. Nasraoui).
Partial support of earlier stages of this work by the Office of Naval Research grant
(N000014-96-1-0439) and by the National Science Foundation Grant
IIS 9800899 is also gratefully acknowledged.
- Bez81
-
J. C. Bezdek.
Pattern Recognition with Fuzzy Objective Function Algorithms.
Plenum Press, New York, 1981.
- Dun74
-
J. C. Dunn.
A fuzzy relative of the isodata process and its use in detecting
compact, well separated clusters.
J. Cybernetics, 3:32-57, 1974.
- GK79
-
E. E. Gustafson and W. C. Kessel.
Fuzzy clustering with a fuzzy covariance matrix.
In IEEE CDC, pages 761-766, San Diego, California, 1979.
- KFN95
-
R. Krishnapuram, H. Frigui, and O. Nasraoui.
Fuzzy and possibilistic shell clustering algorithms and their
application to boundary detection and surface approximation-parts i and ii.
IEEE Trans. Fuzzy Systems, 3(1):29-60, 1995.
- Rus69
-
E. Ruspini.
A new approach to clustering.
Information Control, 15:22-32, 1969.
A Brief Overview of Prototype Based Clustering Techniques
This document was generated using the
LaTeX2HTML translator Version 2002 (1.62)
Copyright © 1993, 1994, 1995, 1996,
Nikos Drakos,
Computer Based Learning Unit, University of Leeds.
Copyright © 1997, 1998, 1999,
Ross Moore,
Mathematics Department, Macquarie University, Sydney.
The command line arguments were:
latex2html -split 0 -image_type gif PrototypeBasedClustering.tex