Constructing a Lexicon of Arabic-English Named Entity using SMT and Semantic Linked Data

Emna Hkiri; Souheyl Mallat; Mounir Zrigui; Mourad Mars

Back

Constructing a Lexicon of Arabic-English Named Entity using SMT and Semantic Linked Data

Journal article

Peer reviewed

Constructing a Lexicon of Arabic-English Named Entity using SMT and Semantic Linked Data

Emna Hkiri, Souheyl Mallat, Mounir Zrigui and Mourad Mars

International arab journal of information technology, Vol.14(6), pp.820-825

01/11/2017

Abstract

Computer Science

Computer Science, Artificial Intelligence

Computer Science, Information Systems

Engineering

Engineering, Electrical & Electronic

Science & Technology

Technology

Named Entity Recognition (NER) is the problem of locating and categorizing atomic entities in a given text. In this work, we used DBpedia Linked datasets and combined existing open source tools to generate from a parallel corpus a bilingual lexicon of Named Entities (NE). To annotate NE in the monolingual English corpus, we used linked data entities by mapping them to Gate Gazetteers. In order to translate entities identified by the gate tool from the English corpus, we used moses, a Statistical Machine Translation (SMT) system. The construction of the Arabic-English NE lexicon is based on the results of moses translation. Our method is fully automatic and aims to help Natural Language Processing (NLP) tasks such as, Machine Translation (MT) information retrieval, text mining and question answering. Our lexicon contains 48753 pairs of Arabic-English NE, it is freely available for use by other researchers.

Metrics

1 Record Views

Details

Title: Constructing a Lexicon of Arabic-English Named Entity using SMT and Semantic Linked Data
Creators - without role: Emna Hkiri - University of Monastir
Souheyl Mallat - University of Monastir
Mounir Zrigui - University of Monastir
Mourad Mars - University of Monastir
Publication Details: International arab journal of information technology, Vol.14(6), pp.820-825
Publisher: Zarka Private Univ
Number of pages: 6
Identifiers: 9931656708331
Academic Unit: Umm Al Qura University
Language: English
Resource Type: Journal article