64 71
Full Length Article
Journal of Cognitive Human-Computer Interaction
Volume 7 , Issue 2, PP: 17-26 , 2024 | Cite this article as | XML | Html |PDF

Title

Extraction of signal features in Voice Signals to train Machine Learning-based Classifier algorithms for Emotion Detection

  Simran Somani 1 * ,   Bhagyashree Shah 2 ,   Bhisaji C. Surve 3

1  Student of MBA TECH (IT), MPSTME, NMIMS University, Mumbai, India
    (simran.somani@nmims.in)

2  Student of MBA TECH (IT), MPSTME, NMIMS University, Mumbai, India
    (bhagyashree.shah@nmims.in)

3  Asst. Professor, Dept. of IT, MPSTME, NMIMS University, Mumbai, India
    (bhisaji.surve@nmims.edu)


Doi   :   https://doi.org/10.54216/JCHCI.070202

Received: October 28, 2023 Revised: January 24, 2024 Accepted: March 21, 2024

Abstract :

This research aims to detect human emotions using speech signals through the development and implementation of methodologies, namely the frequency domain synthesis. To achieve improved results, various machine learning and deep learning models were applied for implementation and their resulting model performance was analyzed. The research findings revealed that each model exhibited different accuracy rates for different emotions but weighted accuracy is best for deep learning based model. This study provides valuable insights into the feasibility and effectiveness of utilizing different methodologies and models for emotion detection through voice signals synthesis. The audio signals are synthesized for Mel-Frequency Cestrum Coefficients (MFCC), Chroma, and MEL characteristics, which are then used as features to train the various machine learning-based classifiers. Python libraries like Librosa, Sklearn, Pyaudio, Numpy, and sound files are used to analyze voice modulations and identify emotions.

Keywords :

MFCC; Emotion Detection; Machine Learning; Neural network.

References :

[1] Girija Deshmukh, Apurva Gaonkar, Gauri Golwalkar, Sukanya Kulkarni, “Speech based Emotion Recognition using Machine Learning”, Institute Of Electrical And Electronics Engineers, Mar. 2019.

[2] Peng Shi, "Speech Emotion Recognition Based on Deep Belief Network", Institute Of Electrical And Electronics Engineers, March 2018.

[3] Ajith Krishna R,Ankit Kumar,Vijay K. "An Automated Optimize Utilization of Water and Crop Monitoring in Agriculture Using IoT." Journal of Cognitive Human-Computer Interaction, Vol. 1, No. 1, 2021 ,PP. 37-45.

[4]  J. Uma Maheswari, A. Akila, "An Enhanced Human Speech Emotion Recognition Using Hybrid of PRNN and KNN", Institute of Electrical and Electronics Engineers, Feb 2019.

[5] Sri Raksha R. Gupta, M.S. Likitha, A. Upendra Raju and K. Hasitha “Speech Based Human Emotion Recognition Using MFCC”, Institute Of Electrical And Electronics Engineers, March 2017.

[6] Parvesh K,Tharun C,Prakash M. "Apparel Recommendation Engine Using Inverse Document Frequency and Weighted Average Word2vec." Journal of Cognitive Human-Computer Interaction, Vol. 1, No. 2, 2021 ,PP. 46-56.

[7] Tian Kexin, Huang Yongming, Zhang Guobao, Zhang Lin, "Research on Emergency Parking Instruction Recognition Based on Speech Recognition and Speech Emotion Recognition", Institute Of Electrical And Electronics Engineers, Nov. 2019.

[8] Ye Sim Ülgen Sonmez, Asaf Varol, "New Trends in Speech Emotion Recognition", Institute of Electrical and Electronics Engineers, June 2019.

[9] T. Giannakopoulos, A. Pikrakis and S. Theodoridis, "A dimensional approach to emotion recognition of speech from movies", IEEE Int'l Conf. Acoustics Speech and Signal Processing (ICASSP), 2009.

[10] Vijay K. "Collaborating The Textual Reviews Of The Merchandise and Foretelling The Rating Supported Social Sentiment." Journal of Cognitive Human-Computer Interaction, Vol. 1, No. 2, 2021 ,PP. 63 - 72.

[11]  A. Hanjalic, "Extracting moods from pictures and sounds: Towards truly personalized TV", IEEE Signal Proc. Magazine, vol. 23, no. 2, pp. 90-100, 2006.

[12]   L. Lu, D. Liu and H.J. Zhang, "Automatic mood detection and tracking of music audio signals", IEEE Trans. on audio speech and language processing, vol. 14, no. 1, pp. 5-18, 2006.

[13] A. Nogueiras, A. Moreno, A. Bonafonte and J.B. Marino, "Speech emotion recognition using hidden Markov models", INTERSPEECH, 2001.

[14]  R. Venkatesan ,Althaaf Shaik ,Suraj Kumar,Vipul Guria ,Abhishek Raj. "Intelligent Smart Dustbin System using Internet of Things (IoT) for Health Care." Journal of Cognitive Human-Computer Interaction, Vol. 1, No. 2, 2021 ,PP. 73 - 80.

[15] R. Plutchik and H. Kellerman, "Emotion: theory research and experience" in Theories of emotion, Academic Press, vol. 1, 1980.

[16] Y. Wang and L. Guan, "Recognizing human emotional state from audiovisual signals", IEEE Trans. on Multimedia, vol. 10, no. 5, pp. 936-946, 2008.

[17] P. Kavitha,R. Subha Shini,R. Priya. "An Implementation Of Statistical Feature Algorithms For The Detection Of Brain Tumor." Journal of Cognitive Human-Computer Interaction, Vol. 1, No. 2, 2021 ,PP. 57 - 62.

[18] Ling Cen, Fei Wu, Zhu Liang Yu, Fengye Hu, “A Real-Time Speech Emotion Recognition System and its Application in Online Learning”, Emotions, Technology, Design and Learning, ScienceDirect, 2016

[19] Mehmet Cen Sezgin,Bilg Gunsel & Gunes Kurt, Perceptual audio features for emotion detection, EURASIP Journal on Audio,Speech and Music Processing, 2012,Article number 16,Springer.

 

[20] Renu Taneja;Aman Bhatia;Javesh Monga;Purva Marwaha, Emotion detection of audio files,2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), IEEE Publication.


Cite this Article as :
Style #
MLA Simran Somani, Bhagyashree Shah, Bhisaji C. Surve. "Extraction of signal features in Voice Signals to train Machine Learning-based Classifier algorithms for Emotion Detection." Journal of Cognitive Human-Computer Interaction, Vol. 7, No. 2, 2024 ,PP. 17-26 (Doi   :  https://doi.org/10.54216/JCHCI.070202)
APA Simran Somani, Bhagyashree Shah, Bhisaji C. Surve. (2024). Extraction of signal features in Voice Signals to train Machine Learning-based Classifier algorithms for Emotion Detection. Journal of Journal of Cognitive Human-Computer Interaction, 7 ( 2 ), 17-26 (Doi   :  https://doi.org/10.54216/JCHCI.070202)
Chicago Simran Somani, Bhagyashree Shah, Bhisaji C. Surve. "Extraction of signal features in Voice Signals to train Machine Learning-based Classifier algorithms for Emotion Detection." Journal of Journal of Cognitive Human-Computer Interaction, 7 no. 2 (2024): 17-26 (Doi   :  https://doi.org/10.54216/JCHCI.070202)
Harvard Simran Somani, Bhagyashree Shah, Bhisaji C. Surve. (2024). Extraction of signal features in Voice Signals to train Machine Learning-based Classifier algorithms for Emotion Detection. Journal of Journal of Cognitive Human-Computer Interaction, 7 ( 2 ), 17-26 (Doi   :  https://doi.org/10.54216/JCHCI.070202)
Vancouver Simran Somani, Bhagyashree Shah, Bhisaji C. Surve. Extraction of signal features in Voice Signals to train Machine Learning-based Classifier algorithms for Emotion Detection. Journal of Journal of Cognitive Human-Computer Interaction, (2024); 7 ( 2 ): 17-26 (Doi   :  https://doi.org/10.54216/JCHCI.070202)
IEEE Simran Somani, Bhagyashree Shah, Bhisaji C. Surve, Extraction of signal features in Voice Signals to train Machine Learning-based Classifier algorithms for Emotion Detection, Journal of Journal of Cognitive Human-Computer Interaction, Vol. 7 , No. 2 , (2024) : 17-26 (Doi   :  https://doi.org/10.54216/JCHCI.070202)