The Effect of using Light Stemming for Arabic Text Classification

Jaffar Atwan; Mohammad Wedyan; Qusay Bsoul; Ahmad Hamadeen; Ryan Alturki; Mohammed Ikram

doi:10.14569/IJACSA.2021.0120589

Back

The Effect of using Light Stemming for Arabic Text Classification

Journal article

Open access

The Effect of using Light Stemming for Arabic Text Classification

Jaffar Atwan, Mohammad Wedyan, Qusay Bsoul, Ahmad Hamadeen, Ryan Alturki and Mohammed Ikram

International journal of advanced computer science & applications, Vol.12(5), pp.768-773

2021

DOI: https://doi.org/10.14569/IJACSA.2021.0120589

Abstract

Computer Science

Computer Science, Theory & Methods

Science & Technology

Technology

Arabic is one of the Semitic languages in antiquity and one of the six official languages of the UN. Also, Arabic classification plays a significant and essential role in modern applications. There is a big difference between handling English text and Arabic text classification; preprocessing is also challenging for Arabic text. This paper presents the implementation of a Naive Bayes classifier for Arabic text with and without stemmer. A set of four categories and 800 documents were used from the Text Retrieval Conference (TREC) 2001 dataset. The results showed that Naive Bayes with light stemmer achieves better results than Naive Bayes without stemmer. The findings of the classifier accuracy by employing stemmer and without stemmer are as preprocessing. It reveals that the accuracy resulted from the light stemmer was better than the classifier without stemmer detection, which Naive Bayes Classification with light stemmer got 35.0745 higher than the Naive Bayes Classification 33.831% without stemmer. After contrasting them, the stemmer got better accuracy than the classifier.

Files and links (1)

url

https://doi.org/10.14569/IJACSA.2021.0120589View

Published (Version of record) Open

Metrics

1 Record Views

Details

Title: The Effect of using Light Stemming for Arabic Text Classification
Creators - without role: Jaffar Atwan - Al Balqa Appl Univ, Dept Comp Informat Syst, Al Salt, Jordan
Mohammad Wedyan - Al Balqa Appl Univ, Fac Artificial Intelligence, Al Salt, Jordan
Qusay Bsoul - Univ Sains Islam Malaysia, Fac Sci & Technol, Bandar Baru Nilai, Malaysia
Ahmad Hamadeen - Al Balqa Appl Univ, Dept Comp Sci, Al Salt, Jordan
Ryan Alturki - Umm al-Qura University
Mohammed Ikram - Umm al-Qura University
Publication Details: International journal of advanced computer science & applications, Vol.12(5), pp.768-773
Publisher: Science & Information Sai Organization Ltd
Number of pages: 6
Identifiers: 9931270008331
Academic Unit: Umm Al Qura University
Language: English
Resource Type: Journal article