DSpace Repository

ArabiziVec: A Set of ArabiziWord Embeddings for Informal Arabic Sentiment Analysis

Show simple item record

dc.contributor.author Sharma, Yashvardhan
dc.date.accessioned 2024-11-12T09:52:21Z
dc.date.available 2024-11-12T09:52:21Z
dc.date.issued 2023
dc.identifier.uri https://www.sentic.net/arabizivec.pdf
dc.identifier.uri http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16350
dc.description.abstract The current circumstances of the Arab world have provided bloggers and commenters with various subjects to discuss. Therefore, Arabic-generated content in social media is ramping up continuously. An informal written form of spoken Arabic called Arabizi has recently emerged as a commonly used language in the Arabic space, attracting great interest for sentiment analysis tasks. However, only a few sentiment resources exist, and state-of-the-art language models such as BERT and FastText do not consider Arabizi yet. This paper presents the first version of ArabiziVec, a set of pre-trained distributed word representations. ArabiziVec provides six different word embedding models to deal with Arabizi sentiment analysis challenges. The presented work surpasses all of the baseline sets for each experiment, regardless of whether the test set is from a previously published dataset or an extracted one. To the best of our knowledge, this is one of the first few resources that deals with Arabizi content and semantics in the context of sentiment analysis en_US
dc.language.iso en en_US
dc.publisher Sentic en_US
dc.subject Computer Science en_US
dc.subject Sentiment analysis en_US
dc.subject Arabizi en_US
dc.subject Semantic models en_US
dc.subject Deep Learning (DL) en_US
dc.title ArabiziVec: A Set of ArabiziWord Embeddings for Informal Arabic Sentiment Analysis en_US
dc.type Article en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account