Title
Towards an automatic extraction of synonyms for Quranic Arabic WordNet
Abstract
In this paper, we developed an automatic extraction model of synonyms, which is used to construct our Quranic Arabic WordNet (QAWN) that depends on traditional Arabic dictionaries. In this work, we rely on three resources. First, the Boundary Annotated Quran Corpus that contains Quran words, Part-of-Speech, root and other related information. Second, the lexicon resources that was used to collect a set of derived words for Quranic words. Third, traditional Arabic dictionaries, which were used to extract the meaning of words with distinction of different senses. The objective of this work is to link the Quranic words of similar meanings in order to generate synonym sets (synsets). To accomplish that, we used term frequency and inverse document frequency in vector space model, and we then computed cosine similarities between Quranic words based on textual definitions that are extracted from traditional Arabic dictionaries. Words of highest similarity were grouped together to form a synset. Our QAWN consists of 6918 synsets that were constructed from about 8400 unique word senses, on average of 5 senses for each word. Based on our experimental evaluation, the average recall of the baseline system was 7.01 %, whereas the average recall of the QAWN was 34.13 % which improved the recall of semantic search for Quran concepts by 27 %.
Year
DOI
Venue
2016
10.1007/s10772-015-9301-9
I. J. Speech Technology
Keywords
Field
DocType
Quranic WordNet, Arabic dictionaries, Cosine similarity, Vector space model, Synonymy, Semantic relations
tf–idf,Semantic search,Cosine similarity,Computer science,Synonym,Speech recognition,Lexicon,Natural language processing,Artificial intelligence,Vector space model,WordNet,Recall
Journal
Volume
Issue
ISSN
19
2
1572-8110
Citations 
PageRank 
References 
0
0.34
9
Authors
3
Name
Order
Citations
PageRank
manal almaayah100.34
Majdi Sawalha2384.25
Mohammad A. M. Abushariah3476.02