Volume 14 , Issue 2 , PP: 01–12, 2026 | Cite this article as | XML | Html | PDF | Full Length Article
Ilknur Ozturk 1 *
Doi: https://doi.org/10.54216/AJBOR.140201
The strategic importance of customer retention in small and medium-sized enterprises (SMEs) is due to the fact that the resources are limited, and the indiscriminate customer acquisition and customer retention campaigns are economically inefficient. However, the descriptive reporting used by many SMEs does not have the advantages of transactiondriven analytics that allows differentiating between high-value and low-yield customer relationships. This paper creates a repli-cable customer-analytics pipeline in SME-type retail environments, using publicly available transactional data. In con-trast to the macro-level forecasting research, the paper integrates customer value segmentation with the futureoriented repeat-purchase prediction and translates the results into retention actions explicitly. The customer-level features were based on invoices, quantities, prices, product variety, and return behavior and were derived using the public Online Retail dataset. Observation windows on a monthly were transformed into a repeat-purchase 90-day problem. Three predictive models—logistic regression, random forest, and gradient boosting—were compared after customer segmentation based on recency, frequency, and monetary behavior. The findings indicate that random forest model had the highest discrimination (ROC-AUC = 0.750; PR-AUC = 0.821), followed by logistic regression, which was only slightly less than it and more interpretable. Segment analysis also showed a very concentrated revenue base with Champions having 27.5 percent of the customers but 67.2 percent of recent revenue and 81.0 rate of repeat purchasing. The paper provides a submission-ready, transparently reproducible, and managerially understandable design that is particularly applicable in SMEs that want low-cost retention analytics, customer ranking, and allocation of marketing resources.
Customer retention , SMEs , Repeat purchase , Customer value , Business data analytics , Retail analytics
[1] UCI Machine Learning Repository (2015). Online Retail. Available at: https://archive.ics.uci.edu/dataset/352/online+retail
[2] Babalghaith, R. and Aljarallah, A. (2024). Factors Affecting Big Data Analytics Adoption in Small and Medium Enterprises. Information Systems Frontiers, 26(6), 2165–2187. https://doi.org/10.1007/s10796-024-10538-2
[3] Sagala, G. H. and Ori, D. (2024). Toward SMEs Digital Transformation Success: A Systematic Literature Review. Information Systems and e-Business Management, 22(4), 667–719. https://doi.org/10.1007/s10257-024 00682-2
[4] Trinh, H. T. (2024). An SME Approach to Data Analytics by Management Accountants in the Transition Economy of Vietnam. Journal of Science and Technology Policy Management. https://doi.org/10.1108/JSTPM 12-2023-0222
[5] Alkhasoneh, O. M., Jamaludin, H., Bin Zahar, A. R. I., and Al-Sharafi, M. A. (2025). Drivers of Social Media Use Among SMEs and Its Impact on Brand Awareness and Customer Engagement. Asia-Pacific Journal of Business Administration, 17(3), 595–615. https://doi.org/10.1108/APJBA-02-2024-0102
[6] Seo, E. and Lee, E. (2025). Linking SMEs’ Customer Strategy to Firm Growth: The Case of Manufacturing Suppliers in South Korea. Asia Pacific Journal of Marketing and Logistics, 37(3), 782–799. https://doi.org/10.1108/APJML-03-2024-0313
[7] Chou, P., Chuang, H. H.-C., Chou, Y.-C., and Liang, T.-P. (2022). Predictive Analytics for Customer Repurchase: Interdisciplinary Integration of Buy Till You Die Modeling and Machine Learning. European Journal of Operational
Research, 296(2), 635–651. https://doi.org/10.1016/j.ejor.2021.04.021
[8] Chen, S., Xu, Z., Xu, D., and Gou, X. (2024). Customer Purchase Prediction in B2C E-Business: A Systematic Review and Future Research Agenda. Expert Systems with Applications, 252, 124261. https://doi.org/10.1016/j.eswa.2024.124261
[9] Kim, S., Shin, W., and Kim, H.-W. (2024). Predicting Online Customer Purchase: The Integration of Customer Characteristics and Browsing Patterns. Decision Support Systems, 177, 114105. https://doi.org/10.1016/j.dss.2023.114105
[10] Du, J. and Chen, W. (2024). Enhancing Community E-Commerce Repurchase Prediction Through Information Entropy Analysis. International Journal of e-Collaboration, 20(1). https://doi.org/10.4018/IJeC.349897
[11] Poudel, S. S., Pokharel, S., and Timilsina, M. (2024). Explaining Customer Churn Prediction in Telecom Industry Using Tabular Machine Learning Models. Machine Learning with Applications, 17, 100567. https://doi.org/10.1016/j.mlwa.2024.100567
[12] Haddadi, S. J., Farshidvard, A., Silva, F. dos S., dos Reis, J. C., and Reis, M. M. (2024). Customer Churn Prediction in Imbalanced Datasets with Resampling Methods: A Comparative Study. Expert Systems with Applications, 246, 123086. https://doi.org/10.1016/j.eswa.2023.123086
[13] Dogan, O., Hiziroglu, A., Pisirgen, A., and Seymen, O. F. (2025). Business Analytics in Customer Lifetime Value: An Overview Analysis. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 15(1), e1571. https://doi.org/10.1002/widm.1571
[14] Roy, S. K., Nazaritehrani, A., Pandit, A., Apostolidis, C., and Ray, S. (2025). AI-Capable Relationship Marketing: Shaping the Future of Customer Relationships. Journal of Business Research, 192, 115309. https://doi.org/10.1016/j.jbusres.2025.115309
[15] Zaghloul, M., Barakat, S., and Rezk, A. (2025). Enhancing Customer Retention in Online Retail Through Churn Prediction: A Hybrid RFM, K-Means, and Deep Neural Network Approach. Expert Systems with Applications, 290, 128465. https://doi.org/10.1016/j.eswa.2025.128465
[16] Wong, A., Viloria Garcia, A., and Lim, Y.-W. (2025). A Data-Driven Approach to Customer Lifetime
Value Prediction Using Probability and Machine Learning Models. Decision Analytics Journal, 16, 100601. https://doi.org/10.1016/j.dajour.2025.100601
[17] Boozary, P., Sheykhan, S., GhorbanTanhaei, H., and Magazzino, C. (2025). Enhancing Customer Retention with Machine Learning: A Comparative Analysis of Ensemble Models for Accurate Churn Prediction. International Journal of Information Management Data Insights, 5(1), 100331. https://doi.org/10.1016/j.jjimei.2025.100331