386 239
Full Length Article
Fusion: Practice and Applications
Volume 13 , Issue 2, PP: 136-144 , 2023 | Cite this article as | XML | Html |PDF

Title

A Tagging Model using Segmentation Proposal Network

  Suha Dh. Athab 1 * ,   Abdulamir A. Karim 2

1  Department of Computer Science, University of Technology, Bagdad, Iraq
    (suha.athab@gmail.com)

2  Department of Computer Science, University of Technology, Bagdad, Iraq
    (110004@uotechnology.edu.iq)


Doi   :   https://doi.org/10.54216/FPA.130212

Received: April 15, 2023 Revised: July 26, 2023 Accepted: October 08, 2023

Abstract :

This paper presents a tagging model used the Segmentation map as reference regions. The suggested model leverages an encoder-decoder architecture combined with a proposal layer and dense layers for accurate object tagging and segmentation. The proposed model utilizes a pre-trained VGG16 encoder to extract high-level features from input images, followed by a decoder network that reconstructs the image. A proposal layer generates a binary map indicating the presence or absence of objects at each location in the image. The proposal layer is integrated with the decoder output and further refined by a convolutional layer to produce the final segmentation. Two dense layers are employed to predict object classes and bounding box coordinates. The model is trained using a custom loss function that combines categorical cross-entropy loss and means squared error loss. Experimental results demonstrate the effectiveness of the proposed model in achieving accurate object tagging and segmentation.

Keywords :

Tagging; Encoder decoder; Semantic segmentation; Object detection

References :

[1]    Mohamed Saber, El-Sayed M. El-Kenawy, Abdelhameed Ibrahim,Marwa M. Eid, Watermarking System for Medical Images Using Optimization Algorithm, Fusion: Practice and Applications, Vol. 10 , No. 1 , (2023) : 89-99 (Doi   :  https://doi.org/10.54216/FPA.100105)

[2]    Khder Alakkari, Alhumaima Ali Subhi, Hussein Alkattan, Ammar Kadi, Artem Malinin, Irina Potoroko, Mostafa Abotaleb, El-Sayed M El-kenawy,  Forecasting COVID-19 Infection Using Encoder-Decoder LSTM and Attention LSTM Algorithms,  Journal of Intelligent Systems and Internet of Things,  Vol. 8 ,  No. 2 ,  (2023) : 20-33 (Doi   :  https://doi.org/10.54216/JISIoT.080202)

[3]    F. Lateef and Y. J. N. Ruichek, "Survey on semantic segmentation using deep learning techniques," vol. 338, pp. 321-348, 2019.

[4]    J. M. Stokes et al., "A deep learning approach to antibiotic discovery," vol. 180, no. 4, pp. 688-702. e13, 2020.

[5]    J. Long, E. Shelhamer, and T. Darrell, "Fully convolutional networks for semantic segmentation," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 3431-3440.

[6]    T. Kong, A. Yao, Y. Chen, and F. Sun, "Hypernet: Towards accurate region proposal generation and joint object detection," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 845-853.

[7]    Luis Freire Lescano, Marcos Lalama Flores, Maria Pico Pico,  Distributed Facial Recognition Facial Recognition in Visual Internet of Things (VIoT) An Intelligent Approach,  Journal of Intelligent Systems and Internet of Things,  Vol. 10 ,  No. 2 ,  (2023) : 18-23 (Doi   :  https://doi.org/10.54216/JISIoT.100202)

[8]    J. Fu et al., "Dual attention network for scene segmentation," in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 3146-3154.

[9]    Enas Kh. Hassan, Jamila Harbi S.,  3D Hand Pose and Shape Estimation from Single RGB Image for Augmented Reality,  Journal of Intelligent Systems and Internet of Things,  Vol. 10 ,  No. 2 ,  (2023) : 90-101 (Doi   :  https://doi.org/10.54216/JISIoT.100208)

[10]  M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, and L.-C. Chen, "Mobilenetv2: Inverted residuals and linear bottlenecks," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 4510-4520.

[11]  H. Zhao, J. Shi, X. Qi, X. Wang, and J. Jia, "Pyramid scene parsing network," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 2881-2890.

[12]  J. Dai, K. He, and J. Sun, "Instance-aware semantic segmentation via multi-task network cascades," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 3150-3158.

[13]  E. Temlioglu, I. Erer, and D. Kumlu, "A least mean square approach to buried object detection in ground penetrating radar," in 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), 2017, pp. 4833-4836: IEEE.

[14]  Z. Zhang and M. J. A. i. n. i. p. s. Sabuncu, "Generalized cross entropy loss for training deep neural networks with noisy labels," vol. 31, 2018.

[15]  T.-Y. Lin et al., "Microsoft coco: Common objects in context," in Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, 2014, pp. 740-755: Springer.

[16]  Z. Zheng, P. Wang, W. Liu, J. Li, R. Ye, and D. Ren, "Distance-IoU loss: Faster and better learning for bounding box regression," in Proceedings of the AAAI conference on artificial intelligence, 2020, vol. 34, no. 07, pp. 12993-13000.

[17]  Z. Hao et al., "Automated tree-crown and height detection in a young forest plantation using mask region-based convolutional neural network (Mask R-CNN)," vol. 178, pp. 112-123, 2021.

[18]       Charchekhandra, B., " The Reading and Analyzing Of The Brain Electrical Signals To Execute a    Control Command and Move an Automatic Arm", Pure Mathematics for Theoretical Computer Science, Vol 1, 2023.

 [19] S. Liu, L. Qi, H. Qin, J. Shi, and J. Jia, "Path aggregation network for instance segmentation," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 8759-8768.

[20]  K. Chen et al., "Hybrid task cascade for instance segmentation," in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 4974-4983.

[21]  P. Sun et al., "Sparse r-cnn: End-to-end object detection with learnable proposals," in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 14454-14463.

[22]        Nadweh, R., " On The Fusion of Neural Networks and Fuzzy Logic, Membership Functions and Weights", Galoitica Journal Of Mathematical Structures and Applications, Vol 7, 2023.

 [23] N. Muhammed Noori,Omar Saber Qasim, Deep Features Selections with Binary Marine Predators Algorithm for Effective Classification of Image Datasets, Fusion: Practice and Applications, Vol. 10 , No. 2 , (2023) : 86-94 (Doi   :  https://doi.org/10.54216/FPA.100208)

[24]  T. Kong, F. Sun, H. Liu, Y. Jiang, and J. J. a. p. a. Shi, "FoveaBox: Beyond anchor-based object detector. arXiv 2019," vol. 2, no. 5.

[25]  Entisar Y. Abd Al-Jabbar,Marwa M. Mohamedsheet Al-Hatab,Maysaloon Abed Qasim,Wameedh Raad Fathel,Maan Ahmed Fadhil, Clinical Fusion for Real-Time Complex QRS Pattern Detection in Wearable ECG Using the Pan-Tompkins Algorithm, Fusion: Practice and Applications, Vol. 12 , No. 2 , (2023) : 172-184 (Doi   :  https://doi.org/10.54216/FPA.120214)


Cite this Article as :
Style #
MLA Suha Dh. Athab, Abdulamir A. Karim. "A Tagging Model using Segmentation Proposal Network." Fusion: Practice and Applications, Vol. 13, No. 2, 2023 ,PP. 136-144 (Doi   :  https://doi.org/10.54216/FPA.130212)
APA Suha Dh. Athab, Abdulamir A. Karim. (2023). A Tagging Model using Segmentation Proposal Network. Journal of Fusion: Practice and Applications, 13 ( 2 ), 136-144 (Doi   :  https://doi.org/10.54216/FPA.130212)
Chicago Suha Dh. Athab, Abdulamir A. Karim. "A Tagging Model using Segmentation Proposal Network." Journal of Fusion: Practice and Applications, 13 no. 2 (2023): 136-144 (Doi   :  https://doi.org/10.54216/FPA.130212)
Harvard Suha Dh. Athab, Abdulamir A. Karim. (2023). A Tagging Model using Segmentation Proposal Network. Journal of Fusion: Practice and Applications, 13 ( 2 ), 136-144 (Doi   :  https://doi.org/10.54216/FPA.130212)
Vancouver Suha Dh. Athab, Abdulamir A. Karim. A Tagging Model using Segmentation Proposal Network. Journal of Fusion: Practice and Applications, (2023); 13 ( 2 ): 136-144 (Doi   :  https://doi.org/10.54216/FPA.130212)
IEEE Suha Dh. Athab, Abdulamir A. Karim, A Tagging Model using Segmentation Proposal Network, Journal of Fusion: Practice and Applications, Vol. 13 , No. 2 , (2023) : 136-144 (Doi   :  https://doi.org/10.54216/FPA.130212)