Optimalisasi K-Means Cluster dengan Principal Component Analysis pada Pengelompokan Kabupaten/Kota di Pulau Kalimantan Berdasarkan Indikator Tingkat Pengangguran Terbuka

  • Muhammad Rais Laboratorium Statistika Komputasi FMIPA Universitas Mulawarman
  • Rito Goejantoro Laboratorium Statistika Komputasi FMIPA Universitas Mulawarman
  • Surya Prangga Laboratorium Statistika Komputasi FMIPA Universitas Mulawarman

Abstract

Data mining or often also called knowledge discovery in databases is an activity that includes collecting, using historical data to find regularity, patterns, or relationships in large data sets resulting in useful new information. Cluster analysis is an analysis that aims to group data based on its likeness. This research uses the K-Means method combined with PCA. The K-Means method groups data in the form of one or more clusters that share the same characteristics. While the PCA method was used to reduce research variables. This grouping method was applied to the data indicator of the unemployment rate of districts/cities in Kalimantan Island in 2018. The cluster validation used in this study was the Davies-Bouldin Index (DBI). Based on the results of the analysis, it was concluded that the number of principal components formed was as many as 2 principal components. The most optimal grouping of districts/cities in Kalimantan island in 2018 was to use 2 clusters with a DBI value of 0,507. The grouping of districts/cities in Kalimantan Island in 2018 produced 2 clusters, cluster 1 consisting of 51 districts/cities and clusters of 2 consisting of 5 districts/cities. Cluster 1 was a cluster that has the highest percentage of the poor population and the highest labor force participation rate when compared to cluster 2. While cluster 2 was a cluster that has an index value of human development, population, number of the labor force, number of unemployed, population density, and the minimum wage of district/city was high compared to cluster 1.

Downloads

Download data is not yet available.
Published
2021-12-30
How to Cite
RAIS, Muhammad; GOEJANTORO, Rito; PRANGGA, Surya. Optimalisasi K-Means Cluster dengan Principal Component Analysis pada Pengelompokan Kabupaten/Kota di Pulau Kalimantan Berdasarkan Indikator Tingkat Pengangguran Terbuka. EKSPONENSIAL, [S.l.], v. 12, n. 2, p. 129-136, dec. 2021. ISSN 2798-3455. Available at: <https://jurnal.fmipa.unmul.ac.id/index.php/exponensial/article/view/805>. Date accessed: 10 may 2024. doi: https://doi.org/10.30872/eksponensial.v12i2.805.
Section
Articles