SDCT: Multi-Dialects Corpus Classification for Saudi Tweets

Afnan Bayazed; Ola Torabah; Redha AlSulami; Dimah Alahmadi; Amal Babour; Kawther Saeedi

doi:10.14569/IJACSA.2020.0111128

Back

SDCT: Multi-Dialects Corpus Classification for Saudi Tweets

Journal article

Open access

SDCT: Multi-Dialects Corpus Classification for Saudi Tweets

Afnan Bayazed, Ola Torabah, Redha AlSulami, Dimah Alahmadi, Amal Babour and Kawther Saeedi

International journal of advanced computer science & applications, Vol.11(11), pp.216-223

2020

DOI: https://doi.org/10.14569/IJACSA.2020.0111128

Abstract

Computer Science

Computer Science, Theory & Methods

Science & Technology

Technology

There is an increasing demand for analyzing the contents of social media. However, the process of sentiment analysis in Arabic language especially Arabic dialects can be very complex and challenging. This paper presents details of collecting and constructing a classified corpus of 4180 multi-dialectal Saudi tweets (SDCT). The tweets were annotated manually by five native speakers in two stages. The first stage annotated the tweets as Hijazi, Najdi, and Eastern based on some Saudi regions. The second stage annotated the sentiment as positive, negative, and natural. The annotation process was evaluated using Kappa Score. The validation process used cross validation technique through eight baseline experiments for training different classifier models. The results present that the 10-folds validation provides greater accuracy than 5-folds across the eight experiments and the classification of the Eastern dialects achieved the best accuracy compared to the other dialects with an accuracy of 91.48%.

Files and links (1)

url

https://doi.org/10.14569/IJACSA.2020.0111128View

Published (Version of record) Open

Metrics

3 Record Views

Details

Title: SDCT: Multi-Dialects Corpus Classification for Saudi Tweets
Creators - without role: Afnan Bayazed - King Abdulaziz University
Ola Torabah - King Abdulaziz University
Redha AlSulami - King Abdulaziz University
Dimah Alahmadi - King Abdulaziz University
Amal Babour - King Abdulaziz University
Kawther Saeedi - King Abdulaziz University
Publication Details: International journal of advanced computer science & applications, Vol.11(11), pp.216-223
Publisher: Science & Information Sai Organization Ltd
Number of pages: 8
Identifiers: 9939029608331
Academic Unit: King Abdulaziz University
Language: English
Resource Type: Journal article