A Machine Learning-Based Approach for Retail Demand Forecasting: The Impact of Spending Score and Algorithm Optimization

Authors

  • Ni Putu Erica Puspita Adriani Adriani ITB STIKOM BALI
  • Roy Rudolf Huizen ITB STIKOM BALI
  • Dadang Hermawan ITB STIKOM BALI

DOI:

https://doi.org/10.26555/jiteki.v11i2.30630

Keywords:

Machine Learning, Random Forest, Decision Tree, Support Vector Machine, Spending Score

Abstract

Demand forecasting in the retail industry remains a critical challenge, with inaccurate predictions leading to substantial inventory inefficiencies, financial losses, and reduced customer satisfaction. Traditional forecasting methods, primarily reliant on historical sales data, often lack the capacity to effectively model the complexities of dynamic consumer behavior and rapid market fluctuations. To address this, this study proposes a refined demand forecasting approach through the introduction of the Spending Score, a novel synthetic feature that synthesizes customer purchase frequency and total spending to augment predictive accuracy. We implement and optimize machine learning algorithms, specifically Random Forest, Decision Tree, and Support Vector Machine (SVM), using rigorous hyperparameter tuning techniques to determine the most effective model for retail demand prediction. Utilizing detailed customer transaction data, this research aims to identify key purchasing patterns that significantly influence demand variability. By integrating the Spending Score into our predictive models, we provide a data-driven framework enabling retailers to optimize inventory management, enhance targeted marketing strategies, and minimize operational inefficiencies. Empirical results demonstrate that the inclusion of the Spending Score leads to more stable and accurate demand forecasts, facilitating improved alignment between supply and market demand. While acknowledging potential limitations, including data scalability issues and the risk of feature-induced bias, future research will explore the integration of real-time data streams, advanced deep learning methodologies, and expanded datasets to further improve predictive capabilities and model adaptability in the continuously evolving retail landscape.

References

[1] L. L. Har, U. K. Rashid, L. Te Chuan, S. C. Sen, and L. Y. Xia, “Revolution of Retail Industry: From Perspective of Retail 1.0 to 4.0,” in Procedia Computer Science, pp. 1615–1625, 2022, https://doi.org/10.1016/j.procs.2022.01.362.

[2] M. S. Sousa, A. L. D. Loureiro, and V. L. Miguéis, “Predicting demand for new products in fashion retailing using censored data,” Expert Syst Appl, vol. 259, Jan. 2025, https://doi.org/10.1016/j.eswa.2024.125313.

[3] R. M. van Steenbergen and M. R. K. Mes, “Forecasting demand profiles of new products,” Decis Support Syst, vol. 139, Dec. 2020, https://doi.org/10.1016/j.dss.2020.113401.

[4] K. Javaid et al., “Explainable Artificial Intelligence Solution for Online Retail,” Computers, Materials and Continua, vol. 71, no. 2, pp. 4425–4442, 2022, https://doi.org/10.32604/cmc.2022.022984.

[5] A. A. Khade, “Performing Customer Behavior Analysis using Big Data Analytics,” in Procedia Computer Science, Elsevier B.V., 2016, pp. 986–992. https://doi.org/10.1016/j.procs.2016.03.125.

[6] H. S. Le et al., “Predictive model for customer satisfaction analytics in E-commerce sector using machine learning and deep learning,” International Journal of Information Management Data Insights, vol. 4, no. 2, Nov. 2024, https://doi.org/10.1016/j.jjimei.2024.100295.

[7] A. Munde and J. Kaur, “Predictive Modelling of Customer Sustainable Jewelry Purchases Using Machine Learning Algorithms,” in Procedia Computer Science, pp. 683–700, 2024, https://doi.org/10.1016/j.procs.2024.04.066.

[8] T. Falatouri, F. Darbanian, P. Brandtner, and C. Udokwu, “Predictive Analytics for Demand Forecasting - A Comparison of SARIMA and LSTM in Retail SCM,” in Procedia Computer Science, pp. 993–1003, 2022, https://doi.org/10.1016/j.procs.2022.01.298.

[9] A. ul Husna, S. H. Amin, and A. Ghasempoor, “Machine Learning Techniques and Multi-objective Programming to Select the Best Suppliers and Determine the Orders,” Machine Learning with Applications, p. 100623, Jan. 2025, https://doi.org/10.1016/j.mlwa.2025.100623.

[10] H. Chan and M. I. M. Wahab, “A machine learning framework for predicting weather impact on retail sales,” Supply Chain Analytics, vol. 5, p. 100058, Mar. 2024, https://doi.org/10.1016/j.sca.2024.100058.

[11] A. Abdulla and G. Baryannis, “A hybrid multi-criteria decision-making and machine learning approach for explainable supplier selection,” Supply Chain Analytics, vol. 7, p. 100074, Sep. 2024, https://doi/org/10.1016/j.sca.2024.100074.

[12] K. Namir, H. Labriji, and E. H. Ben Lahmar, “Decision Support Tool for Dynamic Inventory Management using Machine Learning, Time Series and Combinatorial Optimization,” in Procedia Computer Science, pp. 423–428, 2021, https://doi.org/10.1016/j.procs.2021.12.264.

[13] R. Zimmermann and P. Brandtner, “From Data to Decisions: Optimizing Supply Chain Management with Machine Learning-Infused Dashboards,” in Procedia Computer Science, pp. 955–964, 2024, https://doi.org.10.1016/j.procs.2024.05.184.

[14] S. Arena, E. Florian, F. Sgarbossa, E. Sølvsberg, and I. Zennaro, “A conceptual framework for machine learning algorithm selection for predictive maintenance,” Eng Appl Artif Intell, vol. 133, Jul. 2024, https://doi.org/10.1016/j.engappai.2024.108340.

[15] I. Jebbor, Z. Benmamoun, and H. Hachimi, “Forecasting supply chain disruptions in the textile industry using machine learning: A case study,” Ain Shams Engineering Journal, vol. 15, no. 12, pp. 103116, Dec. 2024, https://doi.org/10.1016/j.asej.2024.103116.

[16] I. F. Chen and C. J. Lu, “Demand forecasting for multichannel fashion retailers by integrating clustering and machine learning algorithms,” Processes, vol. 9, no. 9, Sep. 2021, https://doi.org/10.3390/pr9091578.

[17] S. Akhmetbek, “Forecasting Customer Future Behavior In Retail Business Using Machine Learning Models,” Scientific Journal of Astana IT University, vol. 10, pp. 45–60, Jun. 2022, https://doi.org/10.37943/ILMM7870.

[18] Y. Sun, “Sales Prediction Based on Machine Learning Approach,” International Conference on Social Sciences and Economic Development (ICSSED), pp. 1016–1023, 2024, https://doi.org/10.2991/978-94-6463-459-4_113.

[19] Z. Solver, “Consumer Behavior and Shopping Habits Dataset,” Kaggle, 2021. [Online]. https://www.kaggle.com/datasets/zeesolver/consumer-behavior-and-shopping-habits-dataset.

[20] Ç. Demirel, A. A. Tokuç, and A. T. Tekin, “Click prediction boosting via Bayesian hyperparameter optimization-based ensemble learning pipelines,” Intelligent Systems with Applications, vol. 17, Feb. 2023, https://doi.org/10.1016/j.iswa.2023.200185.

[21] H. Lu, X. Chen, C. Fang, and H. Yang, “Data analytics for optimizing extreme fast charging: a survey,” Data Science and Management, vol. 1, no. 1, pp. 23–31, Mar. 2021, https://doi.org/10.1016/j.dsm.2021.02.001.

[22] X. Xu and Y. Zhang, “A Gaussian process regression machine learning model for forecasting retail property prices with Bayesian optimizations and cross-validation,” Decision Analytics Journal, vol. 8, Sep. 2023, https://doi.org/10.1016/j.dajour.2023.100267.

[23] H. Wijaya, D. P. Hostiadi, and E. Triandini, “Optimization XGBoost Algorithm Using Parameter Tunning in Retail Sales Prediction,” Jurnal Nasional Pendidikan Teknik Informatika (JANAPATI), vol. 13, no. 3, Dec. 2024, https://doi.org/10.23887/janapati.v13i3.82214.

[24] S. Chatterjee, R. Chaudhuri, S. Gupta, U. Sivarajah, and S. Bag, “Assessing the impact of big data analytics on decision-making processes, forecasting, and performance of a firm,” Technol Forecast Soc Change, vol. 196, Nov. 2023, https://doi.org/10.1016/j.techfore.2023.122824.

[25] M. Seyedan and F. Mafakheri, “Predictive big data analytics for supply chain demand forecasting: methods, applications, and research opportunities,” J Big Data, vol. 7, no. 1, 2020, https://doi.org/10.1186/s40537-020-00329-2.

[26] O. Olubusola, N. Z. Mhlongo, D. O. Daraojimba, A. O. Ajayi-Nifise, and T. Falaiye, “Machine learning in financial forecasting: A U.S. review: Exploring the advancements, challenges, and implications of AI-driven predictions in financial markets,” World Journal of Advanced Research and Reviews, vol. 21, no. 2, pp. 1969–1984, Feb. 2024, https://doi.org/10.30574/wjarr.2024.21.2.0444.

[27] N. Zougagh, A. Charkaoui, and A. Echchatbi, “Artificial intelligence hybrid models for improving forecasting accuracy,” in Procedia Computer Science, pp. 817–822, 2021, https://doi.org/10.1016/j.procs.2021.04.013.

[28] A. Hayat, M. U. Ashraf, M. Sajawal, S. Usman, and H. S. Alshaikh, “Predictive Analysis of Retail Sales Forecasting using Machine Learning Techniques,” Lahore Garrison University Research Journal of Computer Science and Information Technology, vol. 6, no. 4, pp. 33-45, 2023, https://doi.org/10.54692/lgurjcsit.2022.06004399.

[29] X. Dairu and Z. Shilong, “Machine Learning Model for Sales Forecasting by Using XGBoost,” in 2021 IEEE International Conference on Consumer Electronics and Computer Engineering, ICCECE, pp. 480–483, Jan. 2021, https://doi.org/10.1109/ICCECE51280.2021.9342304.

[30] L. Ji and S. Li, “A Dynamic Financial Risk Prediction System for Enterprises Based on Gradient Boosting Decision Tree Algorithm,” Systems and Soft Computing, p. 200189, Jan. 2025, https://doi.org/10.1016/j.sasc.2025.200189.

[31] B. K. Meher, M. Singh, R. Birau, and A. Anand, “Forecasting stock prices of fintech companies of India using random forest with high-frequency data,” Journal of Open Innovation: Technology, Market, and Complexity, vol. 10, no. 1, Mar. 2024, https://doi.org/10.1016/j.joitmc.2023.100180.

[32] M. H. Almaspoor, A. Safaei, A. Salajegheh, B. Minaei-Bidgoli, and A. A. Safaei, “Support Vector Machines in Big Data Classiication: A Systematic Literature Review,” 2021, https://doi.org/10.21203/rs.3.rs-663359/v1.

[33] S. K. Kiangala and Z. Wang, “An effective adaptive customization framework for small manufacturing plants using extreme gradient boosting-XGBoost and random forest ensemble learning algorithms in an Industry 4.0 environment,” Machine Learning with Applications, vol. 4, p. 100024, Jun. 2021, https://doi.org/10.1016/j.mlwa.2021.100024.

[34] B. Talekar, “A Detailed Review on Decision Tree and Random Forest,” Biosci Biotechnol Res Commun, vol. 13, no. 14, pp. 245–248, Dec. 2020, https://doi.org/10.21786/bbrc/13.14/57.

[35] S. Ravi Teja Reddy and P. Malathi, “Design and Implementation of Sales Prediction Model Using Decision Tree Regressor over Linear Regression Towards Increase in Accuracy of Prediction,” in Advances in Parallel Computing, pp. 522–529, 2022, https://doi.org/10.3233/APC220074.

[36] Q. Li, X. Wang, Q. Pei, X. Chen, and K.-Y. Lam, “Consistency preserving database watermarking algorithm for decision trees,” Digital Communications and Networks, Jan. 2023, https://doi.org/10.1016/j.dcan.2022.12.015.

[37] S. Sitanskiy, L. Sebastia, and E. Onaindia, “Learning and Recognizing Human Behaviour with Relational Decision Trees,” Procedia Comput Sci, vol. 246, pp. 3168–3177, 2024, https://doi.org/10.1016/j.procs.2024.09.356.

[38] J. Zhang, “Impact of an improved random forest-based financial management model on the effectiveness of corporate sustainability decisions,” Systems and Soft Computing, vol. 6, Dec. 2024, https://doi.org/10.1016/j.sasc.2024.200102.

[39] X. Ma et al., “Predicting the utilization factor of blasthole in rock roadways by random forest,” Underground Space (China), vol. 11, pp. 232–245, Aug. 2023, https://doi.org/10.1016/j.undsp.2023.01.006.

[40] A. B. Adetunji, O. N. Akande, F. A. Ajala, O. Oyewo, Y. F. Akande, and G. Oluwadara, “House Price Prediction using Random Forest Machine Learning Technique,” in Procedia Computer Science, pp. 806–813, 2021, https://doi.org/10.1016/j.procs.2022.01.100.

[41] N. H. Putri, M. Fatekurohman, and I. M. Tirta, “Credit risk analysis using support vector machines algorithm,” in Journal of Physics: Conference Series, vol. 1836, no. 1, p. 012039, Mar. 2021, https://doi.org/10.1088/1742-6596/1836/1/012039.

[42] M. A. Almaiah et al., “Performance Investigation of Principal Component Analysis for Intrusion Detection System Using Different Support Vector Machine Kernels,” Electronics (Switzerland), vol. 11, no. 21, Nov. 2022, https://doi.org/10.3390/electronics11213571.

[43] M. M. Dakwah, A. A. Firdaus, F. Furizal, and R. Faresta, “Sentiment Analysis on Marketplace in Indonesia using Support Vector Machine and Naïve Bayes Method,” Jurnal Ilmiah Teknik Elektro Komputer dan Informatika, vol. 10, no. 1, p. 39, Feb. 2024, https://doi.org/10.26555/jiteki.v10i1.28070.

[44] A. H. Alzraiee and R. G. Niswonger, “A probabilistic approach to training machine learning models using noisy data,” Environmental Modelling and Software, vol. 179, Aug. 2024, https://doi.org/10.1016/j.envsoft.2024.106133.

[45] N. Sharma, R. Sharma, and N. Jindal, “Machine Learning and Deep Learning Applications-A Vision,” Global Transitions Proceedings, vol. 2, no. 1, pp. 24–28, Jun. 2021, https://doi.org/0.1016/j.gltp.2021.01.004.

[46] D. Raditya, E. P. Nicholas, A. S. Ferarida, and N. Hanafiah, “Predicting Sneaker Resale Prices using Machine Learning,” in Procedia Computer Science, pp. 533–540, 2021, https://doi.org/10.1016/j.procs.2021.01.037.

[47] N. Patankar, S. Dixit, A. Bhamare, A. Darpel, and R. Raina, “Customer Segmentation Using Machine Learning,” Advances in Parallel Computing, vol. 39, pp. 239–244, 2021, https://doi.org/10.3233/APC210200.

[48] P. Ganguly and I. Mukherjee, “Enhancing Retail Sales Forecasting with Optimized Machine Learning Models,” International Conference on Sustainable Expert Systems (ICSES), pp. 884-889, Oct. 2024, https://doi.org/10.1109/ICSES63445.2024.10762950.

[49] M. Syberg, N. West, D. Lenze, and J. Deuse, “Framework for predictive sales and demand planning in customer-oriented manufacturing systems using data enrichment and machine learning,” in Procedia CIRP, pp. 1107–1112, 2023, https://doi.org/10.1016/j.procir.2023.09.133.

[50] Z. N. Jawad and V. Balázs, “Machine learning-driven optimization of enterprise resource planning (ERP) systems: a comprehensive review,” Beni Suef Univ J Basic Appl Sci, vol. 13, no. 1, 2024, https://doi.org/10.1186/s43088-023-00460-y.

[51] D. Falkner, M. Bögl, A. Gattinger, R. Stainko, J. Zenisek, and M. Affenzeller, “Integrating Machine Learning into Supply Chain Management: Challenges and Opportunities,” in Procedia Computer Science, pp. 1779–1788, 2024, https://doi.org/10.1016/j.procs.2024.01.176.

Downloads

Published

2025-04-21

How to Cite

[1]
N. P. E. P. A. Adriani, R. R. Huizen, and D. Hermawan, “A Machine Learning-Based Approach for Retail Demand Forecasting: The Impact of Spending Score and Algorithm Optimization”, J. Ilm. Tek. Elektro Komput. Dan Inform, vol. 11, no. 2, pp. 169–183, Apr. 2025.

Issue

Section

Articles

Similar Articles

<< < 1 2 3 4 > >> 

You may also start an advanced similarity search for this article.