Abstract | ||
---|---|---|
Language identification has become a prerequisite for all kinds of automated text processing systems. In this paper, we present a rule-based language identifier tool for two closely related Indo-Aryan languages: Hindi and Magahi. This system has currently achieved an accuracy of approx 86.34%. We hope to improve this in the future. Automatic identification of languages will be significant in the accuracy of output of Web Crawlers. |
Year | Venue | Field |
---|---|---|
2018 | arXiv: Computation and Language | Automatic language identification,Identifier,Computer science,Hindi,Language identification,Artificial intelligence,Natural language processing,Web crawler,Text processing |
DocType | Volume | Citations |
Journal | abs/1804.05095 | 0 |
PageRank | References | Authors |
0.34 | 3 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Priya Rani | 1 | 0 | 2.03 |
Atul Kr. Ojha | 2 | 2 | 4.38 |
Girish Nath Jha | 3 | 53 | 11.43 |