Generating a Lexicon for the Hijazi Dialect in Arabic

Fatimah Abdullah Alqahtani; Mark Sanderson

doi:10.1007/978-3-030-32959-4_1

Back

Generating a Lexicon for the Hijazi Dialect in Arabic

Conference proceeding

Peer reviewed

Generating a Lexicon for the Hijazi Dialect in Arabic

Fatimah Abdullah Alqahtani and Mark Sanderson

ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, Vol.1108, pp.3-17

Communications in Computer and Information Science

01/01/2019

DOI: https://doi.org/10.1007/978-3-030-32959-4_1

Abstract

Computer Science

Computer Science, Artificial Intelligence

Computer Science, Theory & Methods

Linguistics

Science & Technology

Social Sciences

Technology

We present a methodology for creating a lexicon for a low-resource Arabic dialect in Saudi Arabia: Hijazi. We show the differences between the Hijazi dialect and Modern Standard Arabic. We annotate articles and tweets using recruited native speakers. We create a lexicon of Hijazi adapted from two resources: Sebawai and Quranic Arabic Corpus. The lexicon is created both manually and automatically by using Hijazi morphology. We detail the methodology to build this lexicon and present results of an evaluation of the corpus formation process.

Metrics

1 Record Views

Details

Title: Generating a Lexicon for the Hijazi Dialect in Arabic
Creators - without role: Fatimah Abdullah Alqahtani - RMIT University
Mark Sanderson - RMIT University
Contributors - without role: K Smaili
Publication Details: ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, Vol.1108, pp.3-17
Series: Communications in Computer and Information Science
Publisher: Springer Nature
Number of pages: 15
Identifiers: 9922368108331
Academic Unit: King Khalid University
Language: English
Resource Type: Conference proceeding