Clustering Performance Comparison in K-Mean Clustering Variations: A Fraud Detection Study

Authors

  • Subhorn Khonthapagdee Srinakharinwirot university
  • Nattaporn Chuenjarern King Mongkut's Institute of Technology Ladkrabang

Keywords:

Clustering, K-means, Global k-means, Fast Global k-means, BankSim Dataset

Abstract

K-means clustering is a common clustering approach that is based on data partitioning. However, the k-means clustering has significant drawbacks, such as it is sensitive to deciding the initial condition. Several ways to improve the algorithm have been offered. To assess the algorithm's efficiency and correctness, the performance comparison should be evaluated. In this paper, several k-means algorithms, including random k-means, global k-means, and fast global k-means, were evaluated for their efficiency when applied to a fraud detection data set. The accuracy of each method and the Davies-Bouldin index was investigated for each algorithm to compare the clustering performance. The findings demonstrated that when a small number of groups was used, random k-means, global k-means, and fast global k-means gave similar clustering, but fast global k-means offered better errors when a big number of groups was used. Furthermore, global k-means took longer to execute than others.

Downloads

Download data is not yet available.

Downloads

Published

2022-12-30

How to Cite

Khonthapagdee, S., & Chuenjarern, N. (2022). Clustering Performance Comparison in K-Mean Clustering Variations: A Fraud Detection Study. Science Essence Journal, 38(2), 15–25. Retrieved from https://ejournals.swu.ac.th/index.php/sej/article/view/14626