TY - GEN
T1 - A fast and stable incremental clustering algorithm
AU - Young, Steven
AU - Arel, Itamar
AU - Karnowski, Thomas P.
AU - Rose, Derek
PY - 2010
Y1 - 2010
N2 - Clustering is a pivotal building block in many data mining applications and in machine learning in general. Most clustering algorithms in the literature pertain to off-line (or batch) processing, in which the clustering process repeatedly sweeps through a set of data samples in an attempt to capture its underlying structure in a compact and efficient way. However, many recent applications require that the clustering algorithm be online, or incremental, in the that there is no a priori set of samples to process but rather samples are provided one iteration at a time. Accordingly, the clustering algorithm is expected to gradually improve its prototype (or centroid) constructs. Several problems emerge in this context, particularly relating to the stability of the process and its speed of convergence. In this paper, we present a fast and stable incremental clustering algorithm, which is computationally modest and imposes minimal memory requirements. Simulation results clearly demonstrate the advantages of the proposed framework in a variety of practical scenarios.
AB - Clustering is a pivotal building block in many data mining applications and in machine learning in general. Most clustering algorithms in the literature pertain to off-line (or batch) processing, in which the clustering process repeatedly sweeps through a set of data samples in an attempt to capture its underlying structure in a compact and efficient way. However, many recent applications require that the clustering algorithm be online, or incremental, in the that there is no a priori set of samples to process but rather samples are provided one iteration at a time. Accordingly, the clustering algorithm is expected to gradually improve its prototype (or centroid) constructs. Several problems emerge in this context, particularly relating to the stability of the process and its speed of convergence. In this paper, we present a fast and stable incremental clustering algorithm, which is computationally modest and imposes minimal memory requirements. Simulation results clearly demonstrate the advantages of the proposed framework in a variety of practical scenarios.
UR - http://www.scopus.com/inward/record.url?scp=77955287848&partnerID=8YFLogxK
U2 - 10.1109/ITNG.2010.148
DO - 10.1109/ITNG.2010.148
M3 - Conference contribution
AN - SCOPUS:77955287848
SN - 9780769539843
T3 - ITNG2010 - 7th International Conference on Information Technology: New Generations
SP - 204
EP - 209
BT - ITNG2010 - 7th International Conference on Information Technology
T2 - 7th International Conference on Information Technology - New Generations, ITNG 2010
Y2 - 12 April 2010 through 14 April 2010
ER -