Perbandingan Metode Imputasi Data Hilang untuk Peramalan Curah Hujan dengan ARIMA di Palangka Raya

Authors

DOI:

https://doi.org/10.30872/eksponensial.v17i1.1611

Keywords:

ARIMA, Rainfall, Missing Data Imputation, Palangka Raya, Forecasting

Abstract

Rainfall is one of the fundamental factors influencing various aspects of a region, including the city of Palangka Raya. Therefore, statistical tools are needed to forecast rainfall for mitigation and planning related to its effects. The objective of this study is to evaluate the performance of the ARIMA model in forecasting rainfall and to analyze the impact of three methods of missing data imputation on forecast accuracy. Therefore, this study proposes an evaluation of the autoregressive integrated moving average (ARIMA) model's performance for rainfall forecasting and the handling of missing data through three imputation methods: mean imputation, forward filling, and backward filling. The data used in this study consists of historical rainfall data from 2013 to 2022. The forecasting results indicate that rainfall data modelled using ARIMA and treated for missing data via mean imputation demonstrated the best forecasting accuracy in the validation test, as rainfall data in Palangka Raya City is fluctuating without a consistent seasonal pattern. Thus, the historical average value is more representative than the preceding or subsequent values, which could reinforce local bias. This is demonstrated by the smallest RMSE, MAE, and MAPE values, which are 124,17%, 103,57%, and 40,32%, respectively.

Downloads

Download data is not yet available.

References

Alabdulrazzaq, H., Alenezi, M. N., Rawajfih, Y., Alghannam, B. A., Al-Hassan, A. A., & Al-Anzi, F. S. (2021). On the accuracy of ARIMA based prediction of COVID-19 spread. Results in Physics, 27, 104509.

AlSalehy, A. S., & Bailey, M. (2025). Improving Time Series Data Quality: Identifying Outliers and Handling Missing Values in a Multilocation Gas and Weather Dataset. Smart Cities, 8(3), 82.

Appaia, L., & Palraj, S. (2023). On replacement of outliers and missing values in time series. EQA-International Journal of Environmental Quality, 53, 1–10.

Blasco, B. C., Moreno, J. J. M., Pol, A. P., & Abad, A. S. (2013). Using the R-MAPE index as a resistant measure of forecast accuracy. Psicothema, 25(4), 500–506.

Enders, C. K. (2022). Applied missing data analysis. Guilford Publications.

Hodson, T. O. (2022). Root mean square error (RMSE) or mean absolute error (MAE): When to use them or not. Geoscientific Model Development Discussions, 2022, 1–10.

Khikmah, K. N. (2024). Kajian Kinerja Multiple Long Short-Term Memory untuk Analisis Data Deret Waktu Takstasioner [Master Thesis]. IPB University.

Khikmah, K. N., Sadik, K., & Indahwati, I. (2023). Transfer Function And Arima Model For Forecasting Bi Rate In Indonesia. BAREKENG: Jurnal Ilmu Matematika Dan Terapan, 17(3), 1359–1366.

Khikmah, K. N., Sadik, K., & Notodiputro, K. A. (2025). Simulation and Empirical Studies of Long Short-Term Memory Performance to Deal with Limited Data. Jurnal Online Informatika, 10(1), 216–226.

Kobiela, D., Krefta, D., Król, W., & Weichbroth, P. (2022). ARIMA vs LSTM on NASDAQ stock exchange data. Procedia Computer Science, 207, 3836–3845.

Ospina, R., Gondim, J. A. M., Leiva, V., & Castro, C. (2023). An overview of forecast analysis with ARIMA models during the COVID-19 pandemic: Methodology and case study in Brazil. Mathematics, 11(14), 3069.

Risnayah, S. (2023). Penerapan Imputasi Locf Dan Cross Mean Dalam Pengisian Data Kosong Pada Curah Hujan Harian Arg. Megasains, 14(2), 23–31.

Tandon, H., Ranjan, P., Chakraborty, T., & Suhag, V. (2022). Coronavirus (COVID-19): ARIMA-based Time-series Analysis to Forecast near Future and the Effect of School Reopening in India. Journal of Health Management, 09720634221109087.

Wang, X., Zhang, H., Wang, P., Zhang, Y., Wang, B., Zhou, Z., & Wang, Y. (2023). An observed value consistent diffusion model for imputing missing values in multivariate time series. Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2409–2418.

Zhang, Y., & Thorburn, P. J. (2022). Handling missing data in near real-time environmental monitoring: A system and a review of selected methods. Future Generation Computer Systems, 128, 63–72.

Published

2026-04-30