Pre-indexing Techniques in Arabic Information Retrieval

Souheila Ben Guirat; Ibrahim Bounhas; Yahia Slimani

doi:10.5220/0007393402370246

Back

Pre-indexing Techniques in Arabic Information Retrieval

Conference proceeding

Open access

Pre-indexing Techniques in Arabic Information Retrieval

Souheila Ben Guirat, Ibrahim Bounhas and Yahia Slimani

PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, Vol.2, pp.237-246

01/01/2019

DOI: https://doi.org/10.5220/0007393402370246

Abstract

Computer Science

Computer Science, Artificial Intelligence

Science & Technology

Technology

Arabic document indexing is yet challenging given the morphological specificities of this language. Although there has been much effort in the field, developing more efficient indexing approaches is more and more demanding. One of the most important issues concerns the choice of the indexing units (e. g. stems, roots, lemmas, etc.) which both enhances retrieval efficiency and optimizes the indexing process. The question is how to process Arabic texts to retrieve the basic forms which better reflect the meaning of words and documents? In the literature several indexing units have been compared, while combining multiple indexes seems to be promising. In our previous works, we showed that hybrid indexes based on stems, patterns and roots enhances results. However, we need to find the optimal weight of each indexing unit. Therefore, this paper proposes to contribute in optimizing hybrid indexing. We compare and evaluate four pre-indexing methods.

Files and links (1)

url

https://doi.org/10.5220/0007393402370246View

Published (Version of record) Open

Metrics

1 Record Views

Details

Title: Pre-indexing Techniques in Arabic Information Retrieval
Creators - without role: Souheila Ben Guirat - Prince Sattam Bin Abdulaziz University
Ibrahim Bounhas - University of Carthage
Yahia Slimani - Carthage College
Contributors - without role: A P Rocha
L Steels
J VanDenHerik
Publication Details: PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, Vol.2, pp.237-246
Publisher: Scitepress
Number of pages: 10
Identifiers: 9915513908331
Academic Unit: Imam Abdulrahman Bin Faisal University
Language: English
Resource Type: Conference proceeding