Title
Large Scale Arabic Error Annotation: Guidelines and Framework.
Abstract
We present annotation guidelines and a web-based annotation framework developed as part of an effort to create a manually annotated Arabic corpus of errors and corrections for various text types. Such a corpus will be invaluable for developing Arabic error correction tools, both for training models and as a gold standard for evaluating error correction algorithms. We summarize the guidelines we created. We also describe issues encountered during the training of the annotators, as well as problems that are specific to the Arabic language that arose during the annotation process. Finally, we present the annotation tool that was developed as part of this project, the annotation pipeline, and the quality of the resulting annotations.
Year
Venue
Keywords
2014
LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
Error Annotation,Arabic,Guidelines
Field
DocType
Citations 
Annotation,Arabic,Computer science,Text types,Speech recognition,Error detection and correction,Artificial intelligence,Natural language processing
Conference
31
PageRank 
References 
Authors
1.50
13
9
Name
Order
Citations
PageRank
Wajdi Zaghouani119721.27
Behrang Mohit218816.06
Nizar Habash31833145.59
Ossama Obeid4706.43
Nadi Tomeh5351.93
Alla Rozovskaya635022.71
Noura Farra7351.93
Sarah Alkuhlani8643.56
Kemal Oflazer978198.46