ASPG Menu
search

American Scientific Publishing Group

verified Journal

Fusion: Practice and Applications

ISSN
Online: 2692-4048 Print: 2770-0070
Frequency

Continuous publication

Publication Model

Open access · Articles freely available online · APC applies after acceptance

Fusion: Practice and Applications

Volume 18 / Issue 1 ( 21 Articles)

Full Length Article DOI: https://doi.org/10.54216/FPA.180121

Compiler Sequence Optimization Using Machine Learning Prediction Method

Compiler optimization is crucial in improving program performance by improving execution speed, reducing memory usage, and minimizing energy consumption. Nevertheless, modern compilers, such as LLVM, with their numerous optimization passes, present a significant challenge in identifying the most effective sequence for optimizing a program. This study addresses the complex problem of determining optimal compiler optimization sequences within the LLVM framework, which encompasses 64 optimization passes, causing in an immense search space of 264264. Identifying the ideal sequence for even simple code can be an arduous task, as the interactions between passes are intricate and unpredictable. The primary objective of this research is to utilize machine-learning techniques to predict effective optimization sequences that outperform the default -O2 and -O3 optimization flags. The methodology involves generating 2,000 sequences per program and picking the one that achieves the shortest execution time. Three machine learning models—K-Nearest Neighbor (KNN), Decision Tree (DT), and Feedforward Neural Network (FFNN)—were employed to predict the optimization sequences based on features extracted from programs during execution. The study used benchmarks from Polybench, Shootout, and Stanford suites, each with varying problem sizes, to validate the proposed technique. The results demonstrate that the KNN model produced optimization sequences with superior performance compared to DT and FFNN. On average, KNN achieved execution times that were 2.5 times faster than those achieved using the O3 optimization flag. This research contributes to the field by programming the process of selecting optimal compiler sequences, which significantly reduces execution time and eliminates the need for manual tuning. It highlights the potential of machine learning in compiler optimization, offering a robust and scalable approach to improving program performance and setting the foundation for future advancements in the domain.
Diyar Mohammed, Esraa Hadi Alwan, Ahmed Fanfakh
visibility 2568
download 3077
Full Length Article DOI: https://doi.org/10.54216/FPA.180120

Integrating Deep Learning Architecture with Pufferfish Optimization Algorithm for Real-Time Deepfake Video Detection and Classification Model

Deepfake is a technology employed in making definite videos, which are operated utilizing an artificial intelligence (AI) model named deep learning (DL). Deepfake videos were normally videos that cover activities grabbed by definite people but with another individual's face. Substitute of people appearances in videos utilizing the DL model. The technology of Deepfake permits humans to operate videos and images utilizing DL. The outcomes from deepfakes are challenging to differentiate utilizing normal vision. It is a combination of the words DL and fake, and it mostly denotes material shaped by deep neural networks (DNNs), which is a subclass of machine learning (ML). Deepfake denotes numerous modifications of face models, and integrates innovative technologies, with computer vision and DL. The detection of a deepfake model can be assumed as a dual classification procedure that can be categorized as the original or deepfake class. It works by removing features from the videos or images that is employed to distinguish between original and deepfake content. Therefore, this study proposes Leveraging Pufferfish Optimization and Deep Belief Network for an Enhanced Deepfake Video Detection (LPODBN-EDVD) technique. The LPODBN-EDVD technique intends to detect fake videos utilizing the DL model. In the presented LPODBN-EDVD technique, the data preprocessing stages include splitting the video into frames, face detection, and face cropping. For the process of feature extraction, the EfficientNet model is exploited. Besides, the deep belief network (DBN) classifier can be executed for deepfake video detection. Finally, the pufferfish optimization algorithm (POA) is employed for the optimal hyperparameter selection of the DBN classifier. A wide range of simulations was involved in exhibiting the promising results of the LPODBN-EDVD method. The experimental analysis pointed out the enhanced performance of the LPODBN-EDVD technique compared to recent approaches
Sameer Nooh
visibility 2477
download 3654
Full Length Article DOI: https://doi.org/10.54216/FPA.180119

Fusion Data Framework for Enhanced Outlier Detection Integrating Statistical and Machine Learning Techniques for Retail Analytics

This paper aims at presenting an overview of the most popular outlier detection methods that can be used in the retail sector to solve such important problems as fraud, inventory issues, and untypical customer behavior. The techniques discussed in this paper include the conventional statistical methods such as Z-score, Mahalanobis Distance, and Elliptic Envelope and the advanced machine learning methods such as Local Outlier Factor (LOF), Isolation Forest, and DBSCAN. Each method is discussed in detail and the advantages and disadvantages of each are evaluated in relation to different retail scenarios. The primary contribution of this study is the new approach to use Artificial Neural Networks (ANN) for tuning contamination parameters in the Elliptic Envelope model, which makes the anomaly detection more accurate and efficient. Furthermore, the study also depicts the application of min-max scaling for normalizing the features where it helps in reducing the effect of outliers and thus improves the model performance. The results show that the integration of the statistical and machine learning methods is very useful for the real-time detection of anomalies particularly in the ever-changing environment of the retail industry. This research presents a practical insight and new methodological approaches that may be useful for researchers and practitioners who develop outlier detection systems. The outcomes of this study have the potential of enhancing data fusion quality, workflow, and decision-making in the context of retailing.
Botirjon Karimov, Murodjon Sultanov, Jasurbek Nematullaev
visibility 2403
download 3217
Full Length Article DOI: https://doi.org/10.54216/FPA.180118

Enhanced Clustering Techniques for Marine Ad-Hoc Networks Using Dynamic PFCM

Ship Ad Hoc Networks (SANETs) are an integral part of modern maritime communication and shipping, characterized by dynamic topology and heavy traffic. Accurate node localization in SANETs is of great importance to ensure effective communication, security, and operational decisions. Traditional clustering algorithms, such as Fuzzy C-Means (FCM) and Possibilistic Fuzzy C-Means (PFCM), struggle with the dynamic and collaborative nature of SANETs, being sensitive to noise, outliers, and node distribution of rapidly changing. In this paper, a new clustering algorithm, the Dynamic Weighted Gradient-Based Possibilistic using Fuzzy C-Means (DWGB-PFCM), is specially designed to address the limitations of traditional methods in dynamic SANETs. The DWGB-PFCM contains dynamic weighted distances, flexible membership and uniqueness functions, and enhanced objective functions to improve robustness, adaptability, and efficiency of the cluster. Detailed data processing from the National Buoy Data Center (NDBC) combines spatial environmental parameters such as wind speed, atmospheric pressure, and wave characteristics to simulate real-world ocean challenges. Experimental results show that DWGB-PFCM outperforms traditional methods and separation measurements, with PFCM improving by 15.8%, decreasing by 22.2% in separation entropy, and decreasing by 32.1% in RMSE. In addition, DWGB-PFCM achieves a 15.0% improvement in computational efficiency over FCM. This research lays the foundation for further innovations in clustering algorithms designed for dynamic environments.
Ghufran Abdulameer, Yossra H. Ali
visibility 2061
download 3332
Full Length Article DOI: https://doi.org/10.54216/FPA.180117

Enhanced Entity Recognition of Islamic Hadiths based-on Hybrid LSTM and AraBERT Model

This paper focuses on the training, evaluation and development of named entity recognition (NER) models designed for Islamic hadiths in Arabic Utilizing the Hadith Noor dataset, the study uses the BIO (Basic, In, Out) tagging scheme to classify words or tokens in NER tasks and the segmentation of the text into individual tokens. The right-skewed distribution revealed by examining the lengths of the Islamic hadiths revealed a right-skewed distribution, indicating that shorter texts are more common. Texts less than 100 words were most prevalent, followed by texts between 100 and 200 words, while texts longer than 200 words were rare. The dataset identifies eight types of entities, such as common names among narrators and locations. The study by training the three models AraBERT, LSTM and the hybrid model AraBERT-LSTM on Arabic text processing respectively, the hybrid model showed a performance, efficiency and accuracy of 0.981, outperforming the rest of the models, confirming its worth and reliability in NER tasks for natural language in Arabic, especially Islamic hadiths, which opens the way for exploring further investigations for future research in natural language processing.
Wessam Lahmod Nados, Behrooz Minaei Bidgoli, Sayyed Sauleh Eetemadi et al.
visibility 2320
download 3827
Full Length Article DOI: https://doi.org/10.54216/FPA.180116

Intelligent Enhancement of Biometric Verification Using Deep Learning Technology

Biometric verification has grown into critical to privacy across areas such as finance and safe accessing services. The present study addresses the utilization of techniques for deep learning, namely convolutional neural networks (CNNs), to boost both the precision and dependability of biometric authentication. Researchers explore the effectiveness of these algorithms on collections containing genuine and forged banknote photos, taking into account information collecting obstacles such as operator condition changes and ambient conditions. The novelty shows an incredible proficiency in classification of 100%, with clarity, recall, and F1-scores of 1.00 across the two categories, demonstrating that the representation is excellent at discerning amongst legitimate and replica materials. Further, researchers investigate the effects of different design variables on efficiency and precision. This investigation provides important insights into merging deep learning with biometric data, laying the basis for future safe authorization developments.
Maha A. Al-Bayati
visibility 2207
download 4099
Full Length Article DOI: https://doi.org/10.54216/FPA.180115

A Comprehensive Review of Arabic and English Sentiment Analysis in BBC and SANAD News

News agencies connect global events to local communities. It plays a pivotal role in influencing public opinion. Thus, the necessity arises to recognize news article’s sentiment. The purpose of this paper is to analyze sentiment for English and Arabic news articles in terms of positivity, negativity, or neutrality. Analyzing the articles of Arabic and English news can be challenging from the perspective of morphology. In this paper, we introduce 4 Machine Learning methods, including Logistic Regression (LR), k Nearest Neighbors (KNN), Random Forests (RF) and Naive Bayes (NB), with the TF-IDF as the feature extraction. The study was validated using 2 data sets (BBC, SANAD Arabic news), and two learning models (Hold out and 10-fold cross-validation). The evaluation was based on; Accuracy (ACC), Precision (PREC), Recall (REC), F1-score (F1), and The Matthews Correlation Coefficient (MCC) where it shows an outstanding performance for ML on a 10-fold strategy. The experiments provided in the paper indicated that the proposed ML models achieved the best results.
Hassan Al-Sukhni, Qusay Bsoul, Sharaf Alzoubi et al.
visibility 2619
download 4258
Full Length Article DOI: https://doi.org/10.54216/FPA.180114

An Approach to Develop a Model to Detect the Phosphorus and Potassium Deficiency in Paddy Crop on Agriculture Farm Using DIP & ML

Excessive use of fertilizers harms the environment and disrupts plant habitats, while also raising costs for farmers. Proper timing and amounts of nutrients are crucial for plant health and environmental balance. The greenness of rice leaves indicates their chlorophyll and nutrient levels. Agronomy studies show rice plants need 10 nutrients, including primary ones like Nitrogen (N), Phosphorus (P), and Potassium (K), and secondary ones like Iron (Fe), Manganese (Mn), Copper (Cu), Zinc (Zn), Boron (B), Molybdenum (Mo), and Chlorine (Cl). Leaf nitrogen concentration (LNC) is highly correlated with chlorophyll content. There are several tools on LEAF+ to measure it, such as leaf color (LCC), SPAD, chlorophyll or nitrogen. Since these tools are cost-effective and not available to all farmers, LCC offers farmers the ability to estimate plant nitrogen needs in real-time for efficient fertilizer use and increased rice yield. Notable innovation in agriculture is the Leaf Color Chart (LCC), developed by Japanese experts. It measures chlorophyll levels in rice plants and aids in nitrogen management without harming the plant. Today, LCC is used globally to improve production efficiency and optimize nitrogen application rates. The remaining 2 major nutrients potassium and phosphorus can also be measured by experimentally expanding the available database of LCC, as has been done in the two models developed in this research paper.
Mohammad Arif Ali Usmani, Ausaf Ahmad
visibility 2573
download 3427
Full Length Article DOI: https://doi.org/10.54216/FPA.180113

Heart Failure Early Prediction Using Machine And Deep Learning Algorithm

In this article, we use machine learning approaches to give a thorough investigation into the prediction of cardiac illnesses and strokes. The Stroke Prediction Dataset and the Heart Failure Prediction Dataset are the two datasets that we use. Our objective is to maximize accuracy and minimize Mean Absolute Error (MAE) and Mean Squared Error (MSE) in order to enhance predictive performance. We use a variety of machine learning methods, such as Random Forests, Naive Bayes, Decision Trees, and k-Nearest Neighbors (KNN). We also use Artificial Neural Networks (ANN) and Multi-Layer Perceptrons (MLP) as deep learning models. We use oversampling approaches to rectify the imbalance in classes. For hyperparameter tweaking, we also use Grid Search and k-Fold Cross Validation. Our goal is to deliver valuable insights into early detection and preventive measures through comprehensive testing and assessment for prevention of strokes and heart diseases.
Lamis F. Al-Qora’n, Qusay Bsoul, Firas Zawaideh et al.
visibility 2348
download 4731
Full Length Article DOI: https://doi.org/10.54216/FPA.180112

Optimized Machine Learning Framework for SMS Spam Detection and Classification:A Comparative Evaluation

This paper presents an optimized framework for detecting SMS spam using advanced machine learning algorithms and natural language processing (NLP) techniques. Two datasets, the Filtering Mobile Phone Spam Dataset and the SMS Spam Collection Dataset, were utilized to evaluate the performance of various classifiers, including Multinomial Naive Bayes, K-Nearest Neighbors, Support Vector Classifier, Decision Trees, and AdaBoost. The methodology encompasses comprehensive data preprocessing steps, such as tokenization, stopword removal, and text normalization, followed by feature extraction using TF-IDF and Bag-of-Words models. The classifiers’ performances were evaluated using accuracy, precision, recall, and F1-score, alongside cross-validation techniques. Results indicate that Support Vector Classifier and AdaBoost consistently achieved superior accuracy in distinguishing between spam and ham messages. The study underscores the importance of data preprocessing and model optimization in enhancing spam detection accuracy, offering valuable insights for improving SMS filtering systems in cybersecurity applications.
Firas Zawaideh, Qusay Bsoul, Ala Alzoubi et al.
visibility 2211
download 2717
Full Length Article DOI: https://doi.org/10.54216/FPA.180111

A Review of Online Signature Recognition system

Biometrics has reached an important place in the field of authentication for both financial transactions and document verification. Signatures can be broadly classified into online and offline types, depending on how they are acquired. Captured through devices like tablets and digital pens, online signatures contain rich features concerning position, velocity, and acceleration; hence, they offer a better resistance to forgery compared to offline, more traditionally taken signatures. The review summarized the current research in online signature verification systems. There are methodologies and techniques deployed for feature extraction, data pre-processing, and classification. The main stages reviewed within the verification process are about data acquisition, including the use of several publicly available databases like DEEPSIGN, SVC2004 and MCYT-100. Wavelet transforms and Fourier analysis are discussed as a number of methods employed for feature extraction, showing good results about signature dynamics. This review follows the SLR approach for analysing and synthesizing relevant studies published between 2017 and 2024. This review uses PRISMA guidelines for the selection of studies, hence making the results methodologically rigorous and unbiased. The paper identifies commonly used algorithms, including CNN, RNN, and DTW, and examines popular signature databases by outlining their characteristics and relevance to system performance. The insights from this review will help in pointing towards the future ahead in online signature verification systems through emphasizing deep learning-based techniques along with realistic challenges.
Ibtisam Ghazi Nsaif, Sharifah Mumtazah Syed Ahmad, Syamsiah Bt. Mashohor et al.
visibility 2197
download 5848
Full Length Article DOI: https://doi.org/10.54216/FPA.180110

Sentiment Analysis on Amazon Reviews of Mobile Phones using Machine Learning

The world is witnessing a boom in the digital age. Digital shops have literally landed into our homes. Almost any required product can now be purchased online via websites or mobile apps without having to step out. Due to online shopping, many customers rely on online reviews from other customers before making a purchase. Customer reviews are gaining more and more importance as they play a probably vital role in the sale and purchase of a product. Customer reviews also provide firsthand feedback coming directly from the customers themselves; this can benefit even the sellers in improving future sales. Analyzing the reviews can provide probable causes for failure or success of a product. Henceforth, the current paper presents the sentiment analysis of the reviews to better understand the feelings expressed by the customers. The very popular and widely used mobile phones were chosen as the product and Amazon was chosen as the digital seller for the current study. Initially, this work began with data preprocessing. Followed by data preprocessing, Bow and n-grams word embedding have been used to represent the clean reviews in vector representation, and then the features were derived. Finally, the performance of supervised machine learning classifiers such as Decision Tree, Naive Bayes, Random Forest, and SVM was empirically evaluated through accuracy, recall, f1-score, and precision. The results of empirical evaluation revealed that the Random Forest Classifier shows best performance with 97.48% accuracy.
Shweta Singhal, Huda Lafta Majeed, Hassan Muayad Ibrahim et al.
visibility 3230
download 6739
Full Length Article DOI: https://doi.org/10.54216/FPA.180109

Efficient Data Processing Techniques for Structured Data Analysis Using Stream Pipeline Parallelism

 This research illustrates how dynamic task balancing and data sharing may improve distributed data processing. The technology handles parallel processing system difficulties with huge datasets by minimizing resource utilization, time complexity, and output. We modify the workload on the fly after splitting to ensure that all processing units receive equal work. One last optimization phase optimizes job distribution to maximize system efficiency. We test the solution for latency, speed, scalability, resource utilization, fault tolerance, and synchronization overhead. Results reveal that the new strategy outperforms existing ones in every regard. It features the lowest latency, quickest production, and highest growth potential. The approach handles mistakes well, divides data effectively, and syncs everything at a cheap cost. These properties make it ideal for real-time data processing and fast-growing applications. Future study will concentrate on flexible splitting strategies, fault tolerance mechanisms, and predictive analytics machine learning models. These modifications will improve real-time data handling.
Sampath Kini K., D. K. Sreekantha
visibility 2113
download 3292
Full Length Article DOI: https://doi.org/10.54216/FPA.180108

Real-Time Electric Vehicle Battery SOC Estimation Using Advanced Optimization Filtering Techniques

Improving the Extended Kalman Filter's (EKF) State of Charge (SOC) prediction for EV battery packs is the primary goal of this section. Optimised batteries management procedures rely on SOC estimate that is both accurate and reliable. The EKF is a popular tool for estimating nonlinear states, but how well it works relies heavily on which noise coefficient matrices are used (Q and R). Experimental testing and other conventional approaches of calibrating these matrix systems are extremely costly and time-consuming. In order to tackle this, the section delves into the integration of four state-of-the-art metaheuristic optimisation methods: GA, PSO, SFO, and HHO. By minimising the mean square error (MSE) among the real and expected SOC, these techniques optimise the Q and R matrices. When looking at preciseness, converging speed, and resilience, SFO-EKF comes out on top in both static and dynamic comparisons. By greatly improving the reliability of SOC estimations, the numerical results show that SFO-EKF obtains the lowest MSE & RMSE. This study advances electric car batteries by providing a realistic scheme for combining optimisation methods with EKF to offer highly effective and exact SOC estimates. When as opposed to TR-EKF, GA-EKF, PSO-EKF, and HHO-EKF, the SFO-EKF approach shows the best accuracy, with an improvement of over 94%. This is a result of the suggested model's exceptional efficiency in SOC estimates.
Hari Prasad Bhupathi, Srikiran Chinta, Vijayalaxmi Biradar et al.
visibility 2628
download 3284
Full Length Article DOI: https://doi.org/10.54216/FPA.180107

The Imperative Necessity of Erbil-Koya Highway Stretch

Human civilization encompasses all that humans have created, both materially and morally, within a specific time and place. Thus, building highway extensions represents a significant addition to the material aspects of civilization. Highways are a crucial component of human development, affecting societies in social, economic, environmental, urban, and cultural ways. Connecting Erbil with Koya via a highway is expected to affect the populations of both cities and their surrounding areas. This paper examines the role of highways in societal development, with a particular focus on Koya. We have demonstrated the importance of highway design through mathematical models using modern speed parameters, fuzzy logic, and control methods. Additionally, we proposed a method for managing highway speeds through radar and remote sensing technologies. The paper highlights the inevitable societal progress resulting from the Koya-Erbil highway connection.
Abdulqader Othman Hamadameen
visibility 2488
download 3498