ASPG Menu
search

American Scientific Publishing Group

verified Journal

Fusion: Practice and Applications

ISSN
Online: 2692-4048 Print: 2770-0070
Frequency

Continuous publication

Publication Model

Open access · Articles freely available online · APC applies after acceptance

Fusion: Practice and Applications
Full Length Article

Volume 19Issue 2PP: 28-44 • 2025

Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches

Nada M. Sallam 1* ,
Eman Ben Salah 2
1Faculty of Computer Studies, Arab Open University, Riyadh, Saudi Arabia
2Faculty of Business Studies, Arab Open University, Riyadh, Saudi Arabia
* Corresponding Author.
Received: January 07, 2025 Revised: February 17, 2025 Accepted: March 06, 2025

Abstract

One of the significant challenges in modern healthcare is the early and accurate detection of health anomalies, especially in the case of life-threatening diseases such as breast cancer. This paper investigates the comparative efficacy of ML models and statistical methods for the classification of breast tumors as benign or malignant using the Breast Cancer Wisconsin (Diagnostic) Dataset. The dataset, comprising various tumor cell attributes, was preprocessed with Principal Component Analysis (PCA) to enhance model training efficiency. The first 11 principal components retained 95% of the total variance, ensuring minimal information loss while reducing dimensionality. We compared the performance of several machine learning algorithms, including Logistic Regression, Support Vector Machines (SVM), Artificial Neural Networks (ANN), Decision Trees (DT), Random Forests (RF), Naïve Bayes (NB), and K-Nearest Neighbors (KNN). Among them, Logistic Regression, SVM, and ANN achieved near-perfect classification accuracy with balanced precision-recall metrics, where the accuracy rates were all more than 98%. XGBoost and Random Forest were also very impressive as advanced models, while simple models like Decision Trees and Naïve Bayes proved to be less potent and were unable to manage class imbalances and complex data patterns. Our main findings are essentially reflective of the transformative role machine learning would play in healthcare; for instance, enhancing the accuracy of diagnosis, optimizing clinical workflow, and promoting decision-making. These insights are made actionable for practitioners in healthcare to promote the adoption of reliable ML solutions for breast cancer detection. In the future, real-time data integration, external validation, and hybrid modeling approaches must be considered to further enhance the practical utility of these findings.

Keywords

Breast Cancer Classification Machine Learning Artificial intelligence Statistical Methods Dimensionality Reduction

References

[1] P. Esmaeilzadeh, “Challenges and strategies for wide-scale artificial intelligence (AI) deployment in healthcare practices: A perspective for healthcare organizations,” Artif. Intell. Med., vol. 151, p. 102861, 2024. 

[2] R. G. L. da Silva, “The advancement of artificial intelligence in biomedical research and health innovation: challenges and opportunities in emerging economies,” Global Health, vol. 20, no. 1, p. 44, 2024. 

[3] A. Alanazi, “Using machine learning for healthcare challenges and opportunities,” Inform. Med. Unlocked, vol. 30, p. 100924, 2022. 

[4] A. A. Soomro et al., “Insights into modern machine learning approaches for bearing fault classification: a systematic literature review,” Results Eng., p. 102700, 2024. 

[5] Z. Zong and Y. Guan, “AI-driven intelligent data analytics and predictive analysis in Industry 4.0: Transforming knowledge, innovation, and efficiency,” J. Knowl. Econ., pp. 1–40, 2024. 

[6] M. Javaid et al., “Significance of machine learning in healthcare: Features, pillars and applications,” Int. J. Intell. Netw., vol. 3, pp. 58–73, 2022. 

[7] R. Krishnamoorthi et al., “[Retracted] A novel diabetes healthcare disease prediction framework using machine learning techniques,” J. Healthc. Eng., vol. 2022, no. 1, p. 1684017, 2022. 

[8] D. A. Debal and T. M. Sitote, “Chronic kidney disease prediction using machine learning techniques,” J. Big Data, vol. 9, no. 1, p. 109, 2022. 

[9] H. Lu et al., “A patient network-based machine learning model for disease prediction: The case of type 2 diabetes mellitus,” Appl. Intell., vol. 52, no. 3, pp. 2411–2422, 2022. 

[10] S. I. Ayon et al., “Coronary artery heart disease prediction: A comparative study of computational intelligence techniques,” IETE J. Res., vol. 68, no. 4, pp. 2488–2507, 2022. 

[11] Ç. Oğuz and M. Yağanoğlu, “Detection of COVID-19 using deep learning techniques and classification methods,” Inf. Process. Manag., vol. 59, no. 5, p. 103025, 2022. 

[12] J. Chung and J. Teo, “Mental health prediction using machine learning: taxonomy, applications, and challenges,” Appl. Comput. Intell. Soft Comput., vol. 2022, no. 1, p. 9970363, 2022. 

[13] V. Chang et al., “An artificial intelligence model for heart disease detection using machine learning algorithms,” Healthc. Anal., vol. 2, p. 100016, 2022. 

[14] K. Zeberga et al., “[Retracted] A novel text mining approach for mental health prediction using Bi-LSTM and BERT model,” Comput. Intell. Neurosci., vol. 2022, no. 1, p. 7893775, 2022. 

[15] M. Rezapour and L. Hansen, “A machine learning analysis of COVID-19 mental health data,” Sci. Rep., vol. 12, no. 1, p. 14965, 2022. 

[16] A. L. Yadav et al., “Heart diseases prediction using machine learning,” in Proc. 14th Int. Conf. Comput. Commun. Netw. Technol. (ICCCNT), 2023, pp. 1–7. 

[17] A. S. Kwekha-Rashid et al., “Coronavirus disease (COVID-19) cases analysis using machine-learning applications,” Appl. Nanosci., vol. 13, no. 3, pp. 2013–2025, 2023. 

[18] K. Arumugam et al., “Multiple disease prediction using machine learning algorithms,” Mater. Today Proc., vol. 80, pp. 3682–3685, 2023. 

[19] C. M. Bhatt et al., “Effective heart disease prediction using machine learning techniques,” Algorithms, vol. 16, no. 2, p. 88, 2023. 

[20] S. Goyal and R. Singh, “Detection and classification of lung diseases for pneumonia and COVID-19 using machine and deep learning techniques,” J. Ambient Intell. Humaniz. Comput., vol. 14, no. 4, pp. 3239–3259, 2023. 

[21] M. Ajith et al., “A deep learning approach for mental health quality prediction using functional network connectivity and assessment data,” Brain Imaging Behav., pp. 1–16, 2024. 

[22] S. Jayanthi et al., “Mental health status monitoring for people with autism spectrum disorder using machine learning,” Int. J. Inf. Technol., vol. 16, no. 1, pp. 43–51, 2024. 

[23] A. Ahmad, “Breast cancer statistics: Recent trends,” in Breast Cancer Metastasis and Drug Resistance: Challenges and Progress, pp. 1–7, 2019. 

[24] R. D. Joshi and C. K. Dhakal, “Predicting type 2 diabetes using logistic regression and machine learning approaches,” Int. J. Environ. Res. Public Health, vol. 18, no. 14, p. 7346, 2021. 

[25] O. Kramer, “K-nearest neighbors,” in Dimensionality Reduction with Unsupervised Nearest Neighbors, pp. 13–23, 2013. 

[26] S. Zhang, “Nearest neighbor selection for iteratively kNN imputation,” J. Syst. Softw., vol. 85, no. 11, pp. 2541–2552, 2012. 

[27] M. Awad and R. Khanna, “Support vector machines for classification,” in Efficient Learning Machines: Theories, Concepts, and Applications for Engineers and System Designers, pp. 39–66, 2015. 

[28] V. G. Costa and C. E. Pedreira, “Recent advances in decision trees: An updated survey,” Artif. Intell. Rev., vol. 56, no. 5, pp. 4765–4800, 2023. 

[29] K. Fawagreh et al., “Random forests: From early developments to recent advancements,” Syst. Sci. Control Eng., vol. 2, no. 1, pp. 602–609, 2014. 

[30] K. Larsen, “Generalized naive Bayes classifiers,” ACM SIGKDD Explor. Newsl., vol. 7, no. 1, pp. 76–81, 2005.

Cite This Article

Choose your preferred format

format_quote
Sallam, Nada M., Salah, Eman Ben. "Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches." Fusion: Practice and Applications, vol. Volume 19, no. Issue 2, 2025, pp. 28-44. DOI: https://doi.org/10.54216/FPA.190203
Sallam, N., Salah, E. (2025). Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches. Fusion: Practice and Applications, Volume 19(Issue 2), 28-44. DOI: https://doi.org/10.54216/FPA.190203
Sallam, Nada M., Salah, Eman Ben. "Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches." Fusion: Practice and Applications Volume 19, no. Issue 2 (2025): 28-44. DOI: https://doi.org/10.54216/FPA.190203
Sallam, N., Salah, E. (2025) 'Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches', Fusion: Practice and Applications, Volume 19(Issue 2), pp. 28-44. DOI: https://doi.org/10.54216/FPA.190203
Sallam N, Salah E. Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches. Fusion: Practice and Applications. 2025;Volume 19(Issue 2):28-44. DOI: https://doi.org/10.54216/FPA.190203
N. Sallam, E. Salah, "Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches," Fusion: Practice and Applications, vol. Volume 19, no. Issue 2, pp. 28-44, 2025. DOI: https://doi.org/10.54216/FPA.190203
Digital Archive Ready