Optimization of K-Mode Algorithm for Data Mining Using Particle Swarm Optimization (original) (raw)

K-mode is a popular data mining algorithm because of its effective performance in handling categorical data. It has a problem in its methodology in the area of choosing the initial cluster centers for its clustering tasks which usually affects its results. The research proposed a novel PSO K-mode algorithm called PSOKM to improve the performance of K-mode clustering algorithm using PSO. Fitness function was defined based on the structure of K-mode algorithm and weights; the cluster centroids were optimized using PSO. The initial cost for the PSO was taken from K-mode; the weights were picked at random and two centroids from each class were randomly picked. The research used University of California Irvine (UCI) data set and crime data to evaluate the performances of the PSOKM algorithms against conventional K-mode algorithms using metrics such as accuracy, time, sensitivity, specificity and ROC curve. Evaluation result reveals that the PSOKM improved the accuracy of K mode algorit...