Volume 21 , Issue 1 , PP: 131-141, 2026 | Cite this article as | XML | Html | PDF | Full Length Article
Mustafa Radif 1 , Zainab Fahad alnaseri 2 , Salam saad alkafagi 3 , Ali Hakem Al-saeedi 4 , Riyadh Rahef Nuiaa Alogaili 5 * , Mazin Abed Mohammed 6
Doi: https://doi.org/10.54216/FPA.210109
As digital devices continue to process ever-increasing volumes of complex data, ensuring accurate and efficient machine learning performance has become a significant challenge. Traditional ensemble learning methods often attempt to address these issues through data sampling or partitioning; however, such approaches can introduce biases and fail to fully capture the underlying structure of the data. To address these limitations, this paper proposes a novel classification framework that integrates clustering with adaptive weighting strategies. The process begins by dividing the training data into clusters, each representing a specific subset of the overall data distribution. Separate machine learning models are then trained on these clusters, allowing each model to specialize in different areas of the data. When analyzing a test instance, its relationship to the individual clusters is evaluated using two key measures: the correlation coefficient, which assesses feature similarity, and the Mahalanobis distance, which calculates the statistical proximity to the cluster center. These values are subsequently used to generate optimized weights that determine the influence each model should have in the final ensemble prediction. By aligning model contributions with the structural similarities between the test and training data, the proposed approach enhances both the reliability and precision of classification. Experimental results demonstrate that this cluster-aware ensemble consistently outperforms both baseline and advanced classifiers on benchmark datasets.
Ensemble learning , Correlation-based model , Similarity , Data mining , Machine learning
[1] S. M. Hadi, A. H. Alsaeedi, D. Al‐Shammary, Z. A. A. Alyasseri, M. A. Mohammed, K. H. Abdulkareem, R. R. Nuiaa, and M. M. Jaber, "Trigonometric words ranking model for spam message classification," IET Networks, 2022.
[2] F. S. Alrayes, M. Maray, A. Alshuhail, K. M. Almustafa, A. A. Darem, A. M. Al-Sharafi, and S. D. Alotaibi, "Privacy-preserving approach for IoT networks using statistical learning with optimization algorithm on high-dimensional big data environment," Scientific Reports, vol. 15, no. 1, p. 3338, 2025.
[3] A. J. Thompson, B. R. Smith, and C. L. Davis, "A review of machine learning techniques for cybersecurity," Journal of Computer Virology and Hacking Techniques, vol. 17, no. 3, pp. 153-167, 2021.
[4] H. Kuchuk and E. Malokhvii, "Integration of IoT with cloud, fog, and edge computing: a review," Advanced Information Systems, vol. 8, no. 2, pp. 65-78, 2024.
[5] R. R. Nuiaa, S. Manickam, and A. S. D. Alfoudi, "Dynamic Evolving Cauchy Possibilistic Clustering Based on the Self-Similarity Principle (DECS) for Enhancing Intrusion Detection System," 2022.
[6] M. T. Nguyen and H. L. Tran, "An ensemble learning approach for detecting network intrusions," Journal of Network and Computer Applications, vol. 200, p. 103278, 2023.
[7] D. Sargiotis, "Data Quality Management: Ensuring Accuracy and Reliability," in Data Governance: A Guide, Springer, 2024, pp. 197-216.
[8] Z. Sultana, M. Foysal, S. Islam, and A. Foysal, "Lung cancer detection and classification from chest CT images using an ensemble deep learning approach," in 2024 6th International Conference on Electrical Engineering and Information & Communication Technology (ICEEICT), 2024: IEEE, pp. 364-369.
[9] W. Chen, K. Yang, Z. Yu, Y. Shi, and C. L. P. Chen, "A survey on imbalanced learning: latest research, applications and future directions," Artificial Intelligence Review, vol. 57, no. 6, p. 137, 2024.
[10] M. Sakib, S. Mustajab, and M. Alam, "Ensemble deep learning techniques for time series analysis: a comprehensive review, applications, open issues, challenges, and future directions," Cluster Computing, vol. 28, no. 1, pp. 1-44, 2025.
[11] B. H. Aubaidan, R. A. Kadir, M. T. Lajb, M. Anwar, K. N. Qureshi, B. A. Taha, and K. Ghafoor, "A review of intelligent data analysis: Machine learning approaches for addressing class imbalance in healthcare-challenges and perspectives," Intelligent Data Analysis, p. 1088467X241305509, 2024.
[12] M. O. Lawrence, R. G. Jimoh, and W. B. Yahya, "An efficient feature selection and classification system for microarray cancer data using genetic algorithm and deep belief networks," Multimedia Tools and Applications, vol. 84, no. 8, pp. 4393-4434, 2025.
[13] S. Kanthimathi, S. Venkatraman, K. S. Jayasankar, T. P. Jiljith, and R. Jashwanth, "A Novel self-attention-enabled weighted ensemble-based convolutional neural network framework for distributed denial of service attack classification," IEEE Access, 2024.
[14] P. Mamatha, S. Balaji, and S. S. Anuraghav, "Development of Hybrid Intrusion Detection System Leveraging Ensemble Stacked Feature Selectors and Learning Classifiers to Mitigate the DoS Attacks," International Journal of Computational Intelligence Systems, vol. 18, no. 1, p. 20, 2025.
[15] X. Han, R. Zhang, X. Liu, and F. Jiang, "Biologically inspired smart contract: A blockchain-based DDoS detection system," in 2020 IEEE International Conference on Networking, Sensing and Control (ICNSC), 2020: IEEE, pp. 1-6.
[16] M. Beulah and B. P. Manickam, "Detection of DDoS Attack Using Ensemble Machine Learning Techniques," in Soft Computing for Security Applications, Singapore, G. Ranganathan, X. Fernando, F. Shi, and Y. El Allioui, Eds., 2022: Springer Singapore, pp. 889-903.
[17] O. Thorat, N. Parekh, and R. Mangrulkar, "TaxoDaCML: Taxonomy based Divide and Conquer using machine learning approach for DDoS attack classification," International Journal of Information Management Data Insights, vol. 1, no. 2, p. 100048, 2021.
[18] G. Usha, M. Narang, and A. Kumar, "Detection and classification of distributed DoS attacks using machine learning," in Computer Networks and Inventive Communication Technologies: Proceedings of Third ICCNCT 2020, 2021: Springer, pp. 985-1000.
[19] D. Akgun, S. Hizal, and U. Cavusoglu, "A new DDoS attacks intrusion detection model based on deep learning for cybersecurity," Computers & Security, vol. 118, p. 102748, 2022.
[20] I. D. Mienye and Y. Sun, "A survey of ensemble learning: Concepts, algorithms, applications, and prospects," IEEE Access, vol. 10, pp. 99129-99149, 2022.
[21] S. Chen and N. M. Luc, "RRMSE Voting Regressor: A weighting function based improvement to ensemble regression," arXiv preprint arXiv: 2207.04837, 2022.
[22] S. R. Quasar, R. Sharma, A. Mittal, M. Sharma, D. Agarwal, and I. de La Torre Díez, "Ensemble methods for computed tomography scan images to improve lung cancer detection and classification," Multimedia Tools and Applications, vol. 83, no. 17, pp. 52867-52897, 2024.
[23] R. Bakır, C. Orak, and A. Yüksel, "Optimizing hydrogen evolution prediction: A unified approach using random forests, lightGBM, and Bagging Regressor ensemble model," International Journal of Hydrogen Energy, vol. 67, pp. 101-110, 2024.
[24] A. H. Alsaeedi, A. M. Al-juboori, R. R. Nuiaa, Z. A. A. Alyasseri, H. J. Mohammed, N. Sani, M. I. Esa, and B. A. Musawi, "A Hybrid Cracked Tiers Detection System Based on Adaptive Correlation Features Selection and Deep Belief Neural Networks," Symmetry, 2023.
[25] E. Yaghoubi, E. Yaghoubi, A. Khamees, and A. H. Vakili, "A systematic review and meta-analysis of artificial neural network, machine learning, deep learning, and ensemble learning approaches in field of geotechnical engineering," Neural Computing and Applications, vol. 36, no. 21, pp. 12655-12699, 2024.