Volume 13 , Issue 1 , PP: 01-09, 2026 | Cite this article as | XML | Html | PDF | Full Length Article
Hadeel Imad Naser 1 * , Wakaa Ali Hadba 2
Doi: https://doi.org/10.54216/GJMSA.130101
Breast cancer is a common type of cancers and the main reason of increased death of women universally. Recently, ML methods have become important in varying fields, such as Logistic Regression, Elastic Net Logistic, Decision Tree, Random Forest, Boosting, Naive Bayes and K Nearest Neighbor. The aim of the current study is to know and predict the type of cancerous tumor whether it is benign or malignant. These above techniques are expected to be helpful. Breast tumor type diagnosis using numerous performance metrics i.e. accuracy, classification error, sensitivity and specificity, both certified and trained models were assessed. The models were developed to determine which model would provide the best performance and comparisons were done. A separate data set from the one used to create the models was utilized to confirm every model. According to the analysis, the findings showed that elastic net logistic model had the highest performance in accurate classification rate (accuracy), classification error and sensitivity. Making it the best classifier for predicting the kind of breast cancer among all other models, privacy and it was also distinguished by reduce the high dimensionality and multicollinearity problems.
Logistic Regression , Elastic Net Logistic , Decision Tree , Random Forest , Boosting , Naive Bayes , and K- Nearest Neighbor
[1] S. Osei-Afriyie, A. K. Addae, S. Oppong, H. Amu, E. Ampofo, and E. Osei, “Breast cancer awareness, risk factors and screening practices among future health professionals in Ghana: A cross-sectional study,” PLoS One, vol. 16, no. 6, pp. 1–17, 2021, doi: 10.1371/journal.pone.0253373.
[2] S. Murgan, B. Muthu Kumar, and S. Amudha, “Classification and Prediction of Breast Cancer using Linear Regression, Decision Tree and Random Forest,” in 2017 International Conference on Current Trends in Computer, Electrical, Electronics and Communication (CTCEEC), 2017, pp. 763–766, doi: 10.1109/CTCEEC.2017.8455058.
[3] H. Rajaguru and S. R. Sannasi Chakravarthy, “Analysis of decision tree and k-nearest neighbor algorithm in the classification of breast cancer,” Asian Pacific Journal of Cancer Prevention, vol. 20, no. 12, pp. 3777–3781, 2019, doi: 10.31557/APJCP.2019.20.12.3777.
[4] M. M. Islam, M. R. Haque, H. Iqbal, M. M. Hasan, M. Hasan, and M. N. Kabir, “Breast Cancer Prediction: A Comparative Study Using Machine Learning Techniques,” SN Computer Science, vol. 1, no. 5, Sep. 2020, doi: 10.1007/s42979-020-00305-w.
[5] T. A. Assegie, “An optimized K-Nearest neighbor based breast cancer detection,” Journal of Robotics and Control (JRC), vol. 2, no. 3, pp. 115–118, May 2020, doi: 10.18196/jrc.2363.
[6] M. Alharthi, M. H. Lee, and Z. Y. Algamal, “Improving the diagnosis of breast cancer using regularized logistic regression with adaptive elastic net,” Universal Journal of Public Health, vol. 9, no. 5, pp. 317–323, Oct. 2021, doi: 10.13189/ujph.2021.090514.
[7] T. A. Assegie, R. L. Tulasi, and N. K. Kumar, “Breast cancer prediction model with decision tree and adaptive boosting,” IAES International Journal of Artificial Intelligence, vol. 10, no. 1, pp. 184–190, 2021, doi: 10.11591/ijai.v10.i1.pp184-190.
[8] F. K. Nasser and S. F. Behadili, “Breast Cancer Detection using Decision Tree and K-Nearest Neighbour Classifiers,” Iraqi Journal of Science, vol. 63, no. 11, pp. 4987–5003, 2022, doi: 10.24996/ijs.2022.63.11.34.
[9] Bokhare and P. Jha, “Machine learning models applied in analyzing breast cancer classification accuracy,” IAES International Journal of Artificial Intelligence, vol. 12, no. 3, pp. 1370–1377, Sep. 2023, doi: 10.11591/ijai.v12.i3.pp1370-1377.
[10] S. S. Ahmed, Y. Srivastava, and M. G. Khan, “Prediction and Diagnosis of Breast Cancer Using Machine Learning Algorithms,” Asian Journal of Research in Medical and Pharmaceutical Sciences, vol. 13, no. 3, pp. 54–60, 2024, Art. no. AJRIMPS.117744.
[11] S. H. Alwazy, G. Buyrukoğlu, S. Buyrukoğlu, and M. R. Baker, “Evaluating machine learning and statistical learning techniques for cancer classification and diagnosis,” Iranian Journal of Computer Science, 2025, doi: 10.1007/s42044-025-00233-z.
[12] L. I. Kework and S. A. Jalal, “Using Multinomial Logistic Regression Model to predict the most Important Factors Affecting in the students’ selection for Specialization in Secondary Level for Some Schools Erbil City,” University of Kirkuk Journal of Administrative and Economic Sciences, vol. 13, no. 2, pp. 1–22, 2023.
[13] S. Sperandei, “Understanding logistic regression analysis,” Biochemia Medica, vol. 24, no. 1, pp. 12–18, 2014, doi: 10.11613/BM.2014.003.
[14] J. C. Stoltzfus, “Logistic regression: A brief primer,” Academic Emergency Medicine, vol. 18, no. 10, pp. 1099–1104, Oct. 2011, doi: 10.1111/j.1553-2712.2011.01185.x.
[15] Diana, J. E. Griffin, J. Oberoi, and J. Yao, “Machine-Learning Methods for Insurance Applications – A Survey,” 2019.
[16] H. Zou and T. Hastie, “Regularization and variable selection via the elastic net,” Journal of the Royal Statistical Society Series B: Statistical Methodology, vol. 67, no. 2, pp. 301–320, 2005.
[17] D. Shree and J. M. Lincy, “Prediction of chronic kidney disease with supervised machine learning algorithms,” International Journal of Creative Research Thoughts, vol. 11, no. 3, pp. 2320–2882, 2023.
[18] H. Shi, S. Liu, J. Chen, X. Li, Q. Ma, and B. Yu, “Predicting drug-target interactions using Lasso with random forest based on evolutionary information and chemical structure,” Genomics, vol. 111, no. 6, pp. 1839–1852, Dec. 2019, doi: 10.1016/j.ygeno.2018.12.007.
[19] H. Ahmed, M. N. A. Al-Hamadani, and I. A. Satam, “Prediction of COVID-19 disease severity using machine learning techniques,” Bulletin of Electrical Engineering and Informatics, vol. 11, no. 2, pp. 1069–1074, 2022, doi: 10.11591/eei.v11i2.3272.
[20] D. Zhang and J. J.-P. Tsai, Advances in Machine Learning Applications in Software Engineering. Idea Group Publishing, 2007.
[21] Parlina et al., “Naive Bayes Algorithm Analysis to Determine the Percentage Level of visitors the Most Dominant Zoo Visit by Age Category,” in Journal of Physics: Conference Series, vol. 1255, no. 1, Art. no. 012031, Sep. 2019, doi: 10.1088/1742-6596/1255/1/012031.
[22] D. Berrar, “Bayes’ theorem and naive bayes classifier,” in Encyclopedia of Bioinformatics and Computational Biology, vol. 1, Elsevier, 2018, pp. 403–412, doi: 10.1016/B978-0-12-809633-8.20473-1.
[23] S. F. Khorshid and A. M. Abdulazeez, “Breast Cancer Diagnosis Based on K-Nearest Neighbors: A Review,” Journal of Applied Science and Technology Trends, vol. 2, no. 01, pp. 1927–1951, 2021, ISSN: 1567-214X.
[24] Korotcov, V. Tkachenko, D. P. Russo, and S. Ekins, “Comparison of Deep Learning with Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Data Sets,” Molecular Pharmaceutics, vol. 14, no. 12, pp. 4462–4475, Dec. 2017, doi: 10.1021/acs.molpharmaceut.7b00578.
[25] F. H. Anad, “Comparison between logistic regression model and Vector machine to classify observations of COVID-19 patients at Al-Hussein General Hospital,” University of Kirkuk Journal of Administrative and Economic Sciences, vol. 14, no. 3, pp. 261–273, 2024.