Abstract | ||
---|---|---|
Given the steady increase of published and stored information in the form of Arabic unstructured texts, current Information Retrieval (IR) systems must be able to suit the nature and requirements of this language for an accurate and efficient search. This paper sheds light on the challenges in Arabic IR (AIR) and proposes an approach for enhancing the process of AIR based on transforming these texts into structured documents in XML format through a document ontology as well as a set of linguistic grammars. The IR system hence is done on the XML documents. The aim of such system is to incorporate the knowledge on the document structure and on specific content elements in computing the relevance of an information element. A query expansion module mainly based on domain ontology as well as user profile is proposed for the enhancement of the search results. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1007/978-3-319-42911-3_4 | PRICAI |
Keywords | Field | DocType |
Information retrieval, Arabic information retrieval, Unstructured data, Structured data | Human–computer information retrieval,Information retrieval,Query expansion,Computer science,Document Structure Description,Unstructured data,Information extraction,Natural language processing,Relevance (information retrieval),Artificial intelligence,Document retrieval,Concept search | Conference |
Volume | ISSN | Citations |
9810 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 5 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Imen Bouaziz Mezghanni | 1 | 7 | 2.27 |
Faïez Gargouri | 2 | 244 | 92.29 |