Developing Lexicon-based Algorithms and Sentiment Lexicon for Sentiment Analysis of Saudi Dialect Tweets

Waleed Al-Ghaith

doi:10.14569/IJACSA.2019.0101112

Back

Developing Lexicon-based Algorithms and Sentiment Lexicon for Sentiment Analysis of Saudi Dialect Tweets

Journal article

Open access

Developing Lexicon-based Algorithms and Sentiment Lexicon for Sentiment Analysis of Saudi Dialect Tweets

Waleed Al-Ghaith

International journal of advanced computer science & applications, Vol.10(11), pp.83-88

2019

DOI: https://doi.org/10.14569/IJACSA.2019.0101112

Abstract

Computer Science

Computer Science, Theory & Methods

Science & Technology

Technology

Majority of studies on sentiment analysis field, specifically Arabic lexicon-based approach, are focused on doing preprocessing methods on targeted dataset text or collected textual data from Twitter (Twitter dataset) rather than dealing with lexicon itself. This study proposes a new method, we constraint firstly on building a new sentiment lexicon with reasonable number of words and then doing adequate preprocessing methods on the lexicon's words in addition to the (Twitter dataset). The study presents Saudi Dialect Sentiment lexicon called SaudiSentiPlus contains 7139 words which mostly generated from Saudi tweets and other dictionaries. Moreover, this study also presents two lexicon- based algorithms for Saudi dialect to deal with (prefixes and suffixes) letters in order to increase performance of proposed Saudi dialect lexicon. The experiment which has been conducted in this study to evaluate the performance of SaudiSentiPlus comprises four phases. The precision, recall, accuracy, and F-Score are measured in every phase. We built our testing dataset from twitter by focusing on Saudi dialect hashtags (971 thousands tweets from 162 hashtags). The results, show that accuracy of SaudiSentiPlus with the two lexicon- based algorithms reached to 81%.

Files and links (1)

url

https://doi.org/10.14569/IJACSA.2019.0101112View

Published (Version of record) Open

Metrics

2 Record Views

Details

Title: Developing Lexicon-based Algorithms and Sentiment Lexicon for Sentiment Analysis of Saudi Dialect Tweets
Creators - without role: Waleed Al-Ghaith - Imam Mohammad ibn Saud Islamic University
Publication Details: International journal of advanced computer science & applications, Vol.10(11), pp.83-88
Publisher: Science & Information Sai Organization Ltd
Number of pages: 6
Identifiers: 9916197708331
Academic Unit: Imam Mohammad Ibn Saud Islamic University (IMSIU)
Language: English
Resource Type: Journal article