DTOSFS–CatBoost: A Hybrid Metaheuristic Framework for
Accurate and Interpretable Unemployment Forecasting
Ghassan AL-Thabhawee1 Hussein Alkattan2
1 Sciences of Mathematics, Computer Sciences, College of Health and Medical Techniques-Kufa, Al-Furat Al-Awsat Technical
University, Kufa, Iraq
2 Department of System Programming, South Ural State University, Chelyabinsk, Russia; Directorate of Environment in Najaf,
Ministry of Environment, Najaf, Iraq
Emails: gmohammed@atu.edu.iq · alkattan.hussein92@gmail.com
Received: January 03, 2026 Revised: March 01, 2026 Accepted: May 02, 2026 ⋆ Corresponding author
ABSTRACT
The fact that educational, demographic, and macroeconomic variables interact nonlinearly has remained a thorn
in the flesh of socio-economic analytics to date, making it challenging to forecast unemployment with sufficient
precision. To address this, the current study presents a hybrid metaheuristic, Dipper Throated Optimization with
Stochastic Fractal Search (DTOSFS), coupled with the Category Boosting (CatBoost) algorithm to improve predictive
modelling. The suggested DTOSFS-CatBoost system combines the general exploratory search of DTO with SFS
refinement to stochastic local optimization of hyperparameters, and alleviates overfitting. Empirical experiments
have shown that whereas the original CatBoost gave results with a Mean Squared Error (MSE) of 0.0256 and Root
Mean Squared Error (RMSE) of 0.1601 with a correlation coefficient of 0.873, the CatBoost optimized by DTOSFS
had drastically better results with an MSE of 0.00033, RMSE of 0.00207, and a correlation coefficient of 0.930.
These results confirm an increased exploration-to-exploitation ratio in DTOSFS and yield small, powerful designs
that substantially enhance model stability, precision, and convergence speed. These results show that educational
attainment (at least tertiary and primary enrollment) and demographics (at least the birth rate) are influential factors
in unemployment variation. This addition to predictive performance is not the only one, and it provides a predictive
data-driven labor-market optimization paradigm that can be replicated and interpreted. The research observes that
hybrid metaheuristics and gradient boosting can be used to drive next-generation economic intelligence systems for
adaptive policy formulation and to enhance online, privacy-conscious, and cross-domain unemployment prediction.
Keywords: DTOSFS CatBoost Unemployment Forecasting Hybrid Metaheuristic Optimization Socio-Economic
Prediction
1. INTRODUCTION
Employment is one of the most acute problems that negatively
impacts the socioeconomic development of countries,
societies, and people in general [1]. The comparatively high
unemployment rates slow progress and development, aggravate
poverty, and reduce the quality of life of all people .
Moreover, unemployment negatively affects society through
its effects on consumer spending, the growth of dependence
on social programs, and psychological effects on both communities
and individuals. To that end, accurately predicting
unemployment rates could help policymakers and businesses
address these challenges [2]. The study of unemployment patterns,
therefore, helps governments design better approaches,