Proposed Modification of K-Means Clustering Algorithm with Distance Calculation Based on Correlation

Muhammad Ibnu Choldun Rachmatullah

doi:10.26555/jiteki.v8i1.23696

Proposed Modification of K-Means Clustering Algorithm with Distance Calculation Based on Correlation

Authors

Muhammad Ibnu Choldun Rachmatullah Politeknik Pos Indonesia

DOI:

https://doi.org/10.26555/jiteki.v8i1.23696

Keywords:

Clustering, K-Means, Distance calculation, Correlation, Accuracy,

Abstract

Clustering is a technique in data mining that groups a set of data into groups (clusters) of similar data. In general, there are two methods of clustering, namely the hierarchical method and the partition method. One of the most commonly used partition clustering methods in clustering is K-Means. The use of K-means method has been widely used in various fields with various purposes. Many research has been carried out to improve the performance of the K-Means method, for example, by modifying the method of determining the initial centroid or determining the appropriate number of clusters. In this research, the modification of the K-Means algorithm was carried out in calculating the distance by considering the correlation value between attributes. Attributes that have a high correlation value are assumed to have similar characteristics so that they determine the location of data in a particular cluster. The steps of the proposed method are: calculating the correlation value between attributes, determining the cluster centroid, calculating the distance by considering the value of correlation, and determining the data into certain clusters. The first contribution of this research is to propose a new distance calculation technique in the K-Means algorithm by considering correlation and the second contribution is to apply the proposed algorithm to a specific dataset, namely Iris dataset. In this research, the performance calculation of the modified algorithm was also carried out. From the experimental results using the Iris dataset, the proposed modification of the K-Means algorithm has fewer iterations than the original K-Means method, so that it requires less processing time. The original K-Means method requires 8 iterations, while the proposed method requires only 6 iterations. The proposed method also produces a higher accuracy rate of 89.33% than the original K-Means method, which is 82.67%.

Downloads

Published

2022-05-16

How to Cite

[1]

M. I. C. Rachmatullah, “Proposed Modification of K-Means Clustering Algorithm with Distance Calculation Based on Correlation”, J. Ilm. Tek. Elektro Komput. Dan Inform, vol. 8, no. 1, pp. 136–143, May 2022.

Download Citation

Issue

Vol. 8 No. 1 (2022): March

Section

Articles

License

Authors who publish with JITEKI agree to the following terms:

Authors retain copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC BY-SA 4.0) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.

This work is licensed under a Creative Commons Attribution 4.0 International License

About the Journal	Journal Policies	Author	Information
Focus and Scope Editorial Board Reviewer Open Access Policy Sponsorships Contact Us Google Scholar Most Cited Paper	Publication Ethics Peer Review Process Review Guideline Archiving Advertising	Author Guidelines Online Submission Publication Charge / Fee Plagiarism Policy Article Withdrawal	For Readers For Authors Journal History For Editor For Reviewer

Proposed Modification of K-Means Clustering Algorithm with Distance Calculation Based on Correlation

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Similar Articles

special_links

journal_metrics

current_indexing

journal_template_2

Make a Submission

sinta_certificate

visitor_country

visitors

Information