Title
Linguistic Resources for Handwriting Recognition and Translation Evaluation.
Abstract
We describe efforts to create corpora to support development and evaluation of handwriting recognition and translation technology. LDC has developed a stable pipeline and infrastructures for collecting and annotating handwriting linguistic resources to support the evaluation of MADCAT and OpenHaRT. We collect handwritten samples of pre-processed Arabic and Chinese data that has been already translated in English that is used in the GALE program. To date, LDC has recruited more than 600 scribes and collected, annotated and released more than 225,000 handwriting images. Most linguistic resources created for these programs will be made available to the larger research community by publishing in LDC's catalog. The phase 1 MADCAT corpus is now available.
Year
Venue
Keywords
2012
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
handwriting image,recognition,translation
DocType
Citations 
PageRank 
Conference
1
0.40
References 
Authors
3
5
Name
Order
Citations
PageRank
Zhiyi Song1236.94
Safa Ismael291.37
Stephen Grimes3274.56
David Doermann44313312.70
Stephanie Strassel551258.41