Abstract | ||
---|---|---|
Arabic is one of the fastest growing languages on the Web, with an increasing amount of user generated content being published by both native and non-native speakers all over the world. Despite the great linguistic differences between Arabic and western languages such as English, most Arabic keyphrase extraction systems rely on approaches designed for western languages, thus ignoring its rich morphology and syntax. In this paper we present a new approach leveraging the Arabic morphology and syntax to generate a restricted set of meaningful candidates among which keyphrases are selected. Though employing a small set of well-known features to select the final keyphrases, our system consistently outperforms the well-known and established systems. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1109/IALP.2016.7876001 | 2016 International Conference on Asian Language Processing (IALP) |
Keywords | Field | DocType |
Ar&hic Natural Language Processing,Keyphrase Extraction,Lemmatization,Stemming | User-generated content,Arabic,Digital subscriber line,Computer science,Artificial intelligence,Natural language processing,Open system (systems theory),Syntax | Conference |
ISSN | ISBN | Citations |
2159-1962 | 978-1-5090-0923-7 | 0 |
PageRank | References | Authors |
0.34 | 0 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Muhammad Helmy | 1 | 1 | 1.05 |
Dario De Nart | 2 | 34 | 7.70 |
Dante Degl'Innocenti | 3 | 13 | 4.45 |
Carlo Tasso | 4 | 511 | 84.98 |