Subjectivity and Sentiment Analysis of Arabic Twitter Feeds with Limited Resources

Eshrag Refaee; Verena Rieser

Back

Conference proceeding

Subjectivity and Sentiment Analysis of Arabic Twitter Feeds with Limited Resources

Eshrag Refaee and Verena Rieser

LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION

01/01/2014

Abstract

Language & Linguistics

Linguistics

Social Sciences

This paper addresses the task of automatic Subjectivity and Sentiment Analysis (SSA) for Arabic tweets. This is a challenging task because, first, there are no freely available annotated corpora available for this task, and second, most natural language processing (NLP) tools for Arabic are developed for Modern Standard Arabic (MSA) only and fail to capture the wide range of dialects used in Arabic micro-blogs. In the following paper we show that, despite these challenges, we are able to learn a SSA classifier from limited amounts of manually annotated data, which reaches performance levels of up to 87.7% accuracy using cross-validation. However, an evaluation on a independent test set shows that these static models do not transfer well to new data sets, collected at a later point in time. An error analysis confirms that this drop in performance is due to topic-shifts in the twitter stream. Our next step is to extend our current models to perform semi-supervised online learning in order to continuously adapt to the dynamic nature of online media.

Metrics

1 Record Views

Details

Title: Subjectivity and Sentiment Analysis of Arabic Twitter Feeds with Limited Resources
Creators - without role: Eshrag Refaee - Heriot-Watt University
Verena Rieser - Heriot Watt Univ, Interact Lab, Edinburgh EH14 4AS, Midlothian, Scotland
Contributors - without role: N Calzolari
K Choukri
T Declerck
H Loftsson
B Maegaard
J Mariani
A Moreno
J Odijk
S Piperidis
Publication Details: LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
Publisher: European Language Resources Assoc-Elra
Number of pages: 6
Identifiers: 9917984708331
Academic Unit: Jazan University
Language: English
Resource Type: Conference proceeding