Abstract | ||
---|---|---|
Morphological analysis is a fundamental task in natural-language processing, which is used in other NLP applications such as part-of-speech tagging, syntactic parsing, information retrieval, machine translation, etc. In this paper, we present our work on the development of free/open-source finite-state morphological analyser for Sindhi. We have used Apertium's lttoolbox as our finite-state toolkit to implement the transducer. The system is developed using a paradigm-based approach, wherein a paradigm defines all the word forms and their morphological features for a given stem (lemma). We have evaluated our system on the Sindhi Wikipedia, which is a freely-available large corpus of Sindhi and achieved a reasonable coverage of about 81% and a precision of over 97%. |
Year | Venue | Keywords |
---|---|---|
2016 | LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | Sindhi,Morphological Analysis,Finite-State Machines |
Field | DocType | Citations |
Analyser,Computer science,Speech recognition,Finite state,Natural language processing,Sindhi,Artificial intelligence | Conference | 1 |
PageRank | References | Authors |
0.36 | 0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Raveesh Motlani | 1 | 1 | 0.36 |
Francis M. Tyers | 2 | 128 | 21.76 |
Dipti Misra Sharma | 3 | 262 | 45.90 |