Word Sense Disambiguation for Arabic Exploiting Arabic WordNet and Word Embedding

Ali Alkhatlan; Jugal Kalita; Ahmed Alhaddad

doi:10.1016/j.procs.2018.10.460

Back

Word Sense Disambiguation for Arabic Exploiting Arabic WordNet and Word Embedding

Conference proceeding

Open access

Peer reviewed

Word Sense Disambiguation for Arabic Exploiting Arabic WordNet and Word Embedding

Ali Alkhatlan, Jugal Kalita and Ahmed Alhaddad

ARABIC COMPUTATIONAL LINGUISTICS, Vol.142, pp.50-60

Procedia Computer Science

01/01/2018

DOI: https://doi.org/10.1016/j.procs.2018.10.460

Abstract

Computer Science

Computer Science, Artificial Intelligence

Computer Science, Theory & Methods

Science & Technology

Technology

Word Sense Disambiguation (WSD) is a task which aims to identify the meaning of a word given its context. This problem has been investigated and analyzed in depth in English. However, work in Arabic has been limited despite the fact that there are half a billion native Arabic speakers. In this work, we present multiple approaches for the problem of WSD in Arabic utilizing recent developments and successes in learning word embeddings with approaches such as GloVe, and Word2vec. The primary shortcoming of word embeddings is the single vector representation of a word's meaning, although many words are polysemous. Our main contribution in this work is to computationally obtain an embedding for each sense, using an Arabic WordNet (AWN) to overcome the problem of WSD. We also compute word semantic similarity giving thought to multiple Arabic stemming algorithms. Finally, we make available a large pre-processed corpus that is ready to be used for further experiments and a WSD test data based on AWN,' seeking to fill gaps in Arabic NLP (ANLP) compared to English. (C) 2018 The Authors. Published by Elsevier B.V.

Files and links (1)

url

https://doi.org/10.1016/j.procs.2018.10.460View

Published (Version of record) Open

Metrics

1 Record Views

Details

Title: Word Sense Disambiguation for Arabic Exploiting Arabic WordNet and Word Embedding
Creators - without role: Ali Alkhatlan - King Abdulaziz University
Jugal Kalita - University of Colorado Colorado Springs
Ahmed Alhaddad - University of Denver
Contributors - without role: K Shaalan
ElBeltagy
Publication Details: ARABIC COMPUTATIONAL LINGUISTICS, Vol.142, pp.50-60
Series: Procedia Computer Science
Publisher: Elsevier
Number of pages: 11
Identifiers: 9939724308331
Academic Unit: King Abdulaziz University
Language: English
Resource Type: Conference proceeding