Opinion mining for Arabic dialect in social media data fusion platforms: A systematic review

 

Hani D. Hejazi, Ahmed A. Khamees

 

Faculty of Engineering &IT, The British University in Dubai, UAE

 

Emails: hani.hejazi@gmail.com; khamisos@gmail.com

 

 

 

Abstract

The huge text generated on social media in Arabic, especially the Arabic dialect becomes more attractive for Natural Language Processing (NLP) to extract useful and structured information that benefits many domains. The more challenging point is that this content is mostly written in an Arabic dialect with a big data fusion challenge, and the problem with these dialects it has no written rules like Modern Standard Arabic (MSA) or traditional Arabic, and it is changing slowly but unexpectedly. One of the ways to benefit from this huge data fusion is opinion mining, so we introduce this systematic review for opinion mining from Arabic text dialect for the years from 2016 until 2019. We have found that Saudi, Egyptian, Algerian, and Jordanian are the most studied dialects even if it is still under development and need a bit more effort, nevertheless, dialects like Mauritanian, Yemeni, Libyan, and somalin have not been studied in this period. Many data fusion models that show a good result is the last four years have been discussed.

 

 

Keywords: Data Fusion; Arabic dialect; Natural language processing; Opinion mining; Systematic Review.