Benchmarking and assessing the performance of Arabic stemmers - Citegraph

Paper Info

Title
Benchmarking and assessing the performance of Arabic stemmers

Abstract
Previous studies on the stemming of the Arabic language lack fair evaluation, full description of algorithms used or access to the source code of the stemmers and the datasets used to evaluate such stemmers. Freeing source codes and datasets is an essential step to enable researchers to enhance stemmers currently in use and to verify the results of these studies. This study laid the foundation of establishing a benchmark for Arabic stemmers and presents an evaluation of four heavy (root-based) stemmers for the Arabic language. The evaluation aims to assess the accuracy of each of the four stemmers and to show the strength of each. The four algorithms are: Al-Mustafa stemmer, Al-Sarhan stemmer, Rabab芒聙聶ah stemmer and Taghva stemmer. The accuracy and strength tests used in this study ranked Rabab芒聙聶ah stemmer as the first followed by Al-Sarhan, Al-Mustafa, and Taghva stemmers respectively.

Year	DOI	Venue
2011	10.1177/0165551510392305	J. Information Science
Keywords	DocType	Volume
previous study,ah stemmer,Arabic language lack,Taghva stemmer,Freeing source code,fair evaluation,Arabic language,Al-Mustafa stemmer,Al-Sarhan stemmer,Arabic stemmers	Journal	37
Issue	ISSN	Citations
2	0165-5515	10
PageRank	References	Authors
0.99	10	3

Authors (3 rows)

Cited by (10 rows)

References (10 rows)

Name	Order	Citations	PageRank
Mohammed N. Al-Kabi	1	52	5.74
Qasem A. Al-Radaideh	2	77	8.64
Khalid W. Akkawi	3	10	0.99

1