Title
Arabic Script Documents Language Identifications Using Fuzzy ART
Abstract
The volume of information available on the internet, digital libraries and newsgroup has increased dramatically in recent years. Therefore, there is a growing interest in helping user better find, filter, and manage these resources. Language identification is the first step of understanding text documents which is written in. It is usually a module within multilingual application. In this paper, we introduce language identification of Arabic script documents by letter frequency. Technique used for identification is fuzzy adaptive resonance theory (ART), which is belong to the neural network architectures that perform incremental unsupervised learning. Arabic script documents such as Arabic, Persian and Urdu were used for performing language identification. From the experiments, we have found that fuzzy ART is particularly promising in terms of accuracy on language identification.
Year
DOI
Venue
2008
10.1109/AMS.2008.47
Asia International Conference on Modelling and Simulation
Keywords
Field
DocType
fuzzy adaptive resonance theory,letter frequency,digital library,language identifications,multilingual application,incremental unsupervised learning,fuzzy art,arabic script documents,arabic script document,recent year,neural network architecture,language identification,digital libraries,unsupervised learning,natural language processing,internet,text analysis,neural network,resonance,adaptive resonance theory,resource management,frequency,neural networks
Adaptive resonance theory,Computer science,Persian,Intranet,Unsupervised learning,Urdu,Natural language processing,Language identification,Artificial intelligence,The Internet,Arabic script
Conference
Citations 
PageRank 
References 
1
0.37
14
Authors
2
Name
Order
Citations
PageRank
Ali Selamat171777.40
Choon-Ching Ng2396.64