This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy.
Paper The following article is Open access

Comparative Analysis on Dimension Reduction Algorithm of Principal Component Analysis and Singular Value Decomposition for Clustering

, , , , and

Published under licence by IOP Publishing Ltd
, , Citation Elly Muningsih et al 2020 J. Phys.: Conf. Ser. 1641 012101 DOI 10.1088/1742-6596/1641/1/012101

1742-6596/1641/1/012101

Abstract

Clustering is a method of dividing datasets into several groups that have similarity or the same characteristics. High-dimensional Datasets will influence the effectiveness of the grouping process. This study compares two dimension reduction algorithms, namely Principal Component Analysis (PCA) and Singular Value Decomposition (SVD) using K-Means clustering method to find out the best algorithm with the smallest Bouldin Davies Index evaluation. The dataset of this study involved public data from UCIMachine Learning which contains the number of weekly sales of a product. Data processing is performed by comparing the number of clusters from 3 to 10 and the dimension reduction from 2 to 10. From the data processing the RapidMiner tools, application with dimension reduction can provide better results than without dimension reduction. In particular, the PCA algorithm shows better results than the SVD, with which the best number of clusters is 5, and the number of dimensional reductions is 3 with a Bouldin Index of 0.376.

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.
10.1088/1742-6596/1641/1/012101