Abstract | ||
---|---|---|
This paper suggests a new type of indexing Arabic Language text that contribute to improving the quality of IRS. The proposed method of indexing belongs to semi-automatic category of indexing and consists of two types. The first type conducts an online indexing and the output of this process give a rise to a Partial index. The second type - under this method- is an offline indexing and the output of this process leads to a General index. We illustrate application and the performance of this new method of indexing using an Arabic text editor and Information Retrieval tool developed and designed for this purpose. We also illustrate the process of building a new form of Arabic corpus appropriate to conduct the necessary experiments. Our findings show that the online indexing model successfully identifies the descriptors most relevant to the document. In addition, this model is more efficient as it helps minimizing index storage size, consequently, improving the response time of the different requests. Finally, the paper proposes a solution to issues and deficiencies Arabic language processing suffers from, especially regarding corpora building and information retrieval evaluation systems. |
Year | DOI | Venue |
---|---|---|
2018 | 10.31449/inf.v42i4.2297 | INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS |
Keywords | Field | DocType |
online indexing, offline indexing, semi-automatic indexing, Arabic keywords extraction, Arabic information retrieval system | Partial index,Arabic,Information retrieval,Computer science,Response time,Search engine indexing,Automatic indexing,Retrieval algorithm | Journal |
Volume | Issue | ISSN |
42 | 4 | 0350-5596 |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Tahar Dilekh | 1 | 0 | 0.34 |
Benharzallah Saber | 2 | 6 | 6.58 |
Ali Behloul | 3 | 8 | 2.56 |