Penerapan CRISP-DM Model dengan Algoritma Decision Tree, Naive Bayes, SVM, dan KNN untuk Klasifikasi Curah Hujan di Kabupaten Malang

Penulis

  • Arizki Dwi Cahyo Politeknik Statistika STIS image/svg+xml
  • Arie Wahyu Wijayanto

DOI:

https://doi.org/10.30872/eksponensial.v17i1.1649

Kata Kunci:

CRISP-DM, rainfall, Decision Tree, Naive Bayes, SVM, KNN

Abstrak

Rainfall refers to the amount of rain that falls over a specific period and is measured on a flat surface that does not absorb or channel the water. Rainfall is classified into six categories: cloudy, light rain, moderate rain, heavy rain, very heavy rain, and extreme rain. The objective of this study is to compare several classification methods in data mining, namely Decision Tree, Naïve Bayes, Support Vector Machine (SVM), and K-Nearest Neighbors (KNN) to determine the method with the highest accuracy for classifying rainfall in Malang Regency. To achieve this objective, the procedure used refers to CRISP-DM, which is a methodology for planning data mining projects. The data used in this study were obtained from NASA and consist of daily records collected from June 1, 2019, to June 1, 2025. The total number of observations in this study is 2,193. Data mining classification was employed to identify patterns and similarities in rainfall characteristics in the area. Based on the analysis results, the KNN method demonstrated the best performance with an accuracy of 87.3% and is therefore recommended as the most effective method for rainfall classification in Malang Regency.

Unduhan

Data unduhan tidak tersedia.

Referensi

Aris Gunadi, I. G., & Kusuma Dewi, A. A. (2018). Klasifikasi Curah Hujan di Provinsi Bali Berdasarkan Metode Naïve Bayesian. Wahana Matematika Dan Sains: Jurnal Matematika, Sains, Dan Pembelajarannya, 12(1), 14–25. https://doi.org/10.23887/wms.v12i1.13843

Aris Gunadi, I. G., Oka Gunawan, I. M. A., Hary Candana, P. E. W., Widyantari Arnawa, I. A., & Edo Kharisma Putra, K. A. (2022). Klasifikasi Curah Hujan Harian Menggunakan Learning Vector Quantization (Studi Kasus: Stasiun Pengamatan Ngurah Rai). Jurnal Ilmu Komputer Indonesia (JIK), 7(2), 1–7. https://doi.org/10.23887/jik.v7i2.4060

Basuki, A., & Syarif, I. (2003). Decision Tree. Politeknik Electronika Negeri.

BMKG. (2020). Buletin Meteorologi Edisi Juli 2020. Badan Meteorologi, Klimatologi, dan Geofisika.

BMKG. (2025). Catatan Iklim dan Kualitas Udara Indonesia 2024. Badan Meteorologi, Klimatologi, dan Geofisika.

Brasjö, C., & Lindovsky, M. (2019). Machine Learning Project Management: A Study of Project Requirements and Processes in Early Adoption.

Damiri, B. A., Ramadhan, W. N., & Supriyanti, K. R. (2024). Pengelompokan Faktor-Faktor yang Mempengaruhi Curah Hujan di Provinsi Sumatera Utara Menggunakan Metode Fuzzy C-Means. Jurnal Statistika Dan Komputasi, 3(1), 1–10. https://doi.org/10.32665/statkom.v3i1.2623

DataIndonesia. (2025). Data wilayah dengan curah hujan ekstrem harian tertinggi di Indonesia pada 2024. Bersumber dari BMKG. Diakses pada 29 Maret 2026, dari https://dataindonesia.id/varia/detail/data-wilayah-dengan-curah-hujan-ekstrem-harian-tertinggi-di-indonesia-pada-2024

Harahap, W. N., Yuniasih, B., & Gunawan, S. (2023). Dampak La Nina 2021-2022 terhadap Peningkatan Curah Hujan. AGROISTA : Jurnal Agroteknologi, 7(1), 26–32. https://doi.org/10.55180/agi.v7i1.364

Hasanah, M. A., Soim, S., & Handayani, A. S. (2021). Implementasi CRISP-DM Model Menggunakan Metode Decision Tree dengan Algoritma CART untuk Prediksi Curah Hujan Berpotensi Banjir. Journal of Applied Informatics and Computing (JAIC), 5(2), 103–108. https://doi.org/10.30871/jaic.v5i2.3200

Indra Pratama, A. R., Latipah, S. A., & Sari, B. N. (2022). Optimasi Klasifikasi Curah Hujan Menggunakan Support Vector Machine (SVM) dan Recursive Feature Elimination (RFE). Jurnal Ilmiah Penelitian Dan Pembelajaran Informatika, 7(2), 314–324. https://doi.org/10.29100/jipi.v7i2.2675

Kataria, A., & Singh, M. D. (2013). A Review of Data Classification Using K-Nearest Neighbour Algorithm. International Journal of Emerging Technology and Advanced Engineering, 3(6), 354–360.

Nabila, M. P., Tanjung, M. R., Abelia, & Usiono. (2024). Waspada! Curah Hujan yang Cukup Tinggi: Sumatera Utara. Jurnal Media Akademik (JMA), 2(12), 1–11.

Pebdika, A., Herdiana, R., & Solihudin, D. (2023). Klasifikasi Menggunakan Metode Naïve Bayes untuk Menentukan Calon Penerima PIP. Jurnal Mahasiswa Teknik Informatika, 7(1), 452–458.

Plotnikova, V., Dumas, M., & Milani, F. (2020). Adaptations of data mining methodologies: A systematic literature review. PeerJ Computer Science, 6, 1–43. https://doi.org/10.7717/PEERJ-CS.267

Ramadhan, M. A., Anggraeny, F. T., & Putra, C. A. (2024). Klasifikasi Curah Hujan Harian Menggunakan Metode K-Nearest Neighbor. Jurnal Mahasiswa Teknik Informatika, 8(3), 3863–3869. https://doi.org/10.36040/jati.v8i3.9817

Susilowati, & Kusumastuti, D. I. (2023). Analisa Karakteristik Curah Hujan dan Kurva Intensitas Durasi Frekuensi (IDF) di Provinsi Lampung. Jurnal Rekayasa Teknik Sipil Universitas Lampung, 14(1), 47–56.

Sutedjo, M. M., & Kartasapoetra, A. G. (2002). Pengantar Ilmu Tanah. Bina Aksara.

Ullah, R., Khan, A. H., & Emaduddin, S. M. (2019). ck-NN: A Clustered k-Nearest Neighbours Approach for Large-Scale Classification. Advances in Distributed Computing and Artificial Intelligence Journal, 8(3), 67–77. https://doi.org/10.14201/ADCAIJ2019836777

Wu, X., Kumar, V., Quinlan, J. R., Ghosh, J., Yang, Q., Motoda, H., McLachlan, G. J., Ng, A., Liu, B., Yu, P. S., Zhou, Z. H., Steinbach, M., Hand, D. J., & Steinberg, D. (2008). Top 10 algorithms in data mining. Knowledge and Information Systems, 14(1), 1–37. https://doi.org/10.1007/s10115-007-0114-2

Yatimah, N., Kumalawati, R., & Muhtar, G. A. (2024). Analisis Kerentanan Bencana Banjir Berdasarkan Data Curah Hujan Kota Samarinda. Jurnal Multidisiplin Raflesia, 3(1), 28–32.

Zhao, Y., & Cen, Y. (2014). Data Mining Applications with R. USA : Academic Press.

Diterbitkan

2026-04-28