Title
Combining handwriting and speech recognition for transcribing historical handwritten documents
Abstract
Transcription of historical documents is an interesting task for libraries in order to make available their funds. In the lasts years, the use of Handwritten Text Recognition allowed paleographs to speed up the manual transcription process, since they are able to correct on a draft transcription. Another alternative is obtaining the draft transcription by dictating the contents to an Automatic Speech Recognition system. When both sources (image and speech) are available, a multimodal combination is possible, and an iterative process can be used in order to refine the final hypothesis. In this work, a multimodal combination based on confusion networks is presented. Results on two different sets of data, with different difficulty level, show that the proposed technique provides similar or better draft transcriptions than a previously proposed approach, allowing for a faster transcription process.
Year
DOI
Venue
2015
10.1109/ICDAR.2015.7333739
International Conference on Document Analysis and Recognition
Keywords
Field
DocType
Historical handwritten transcription, multimodal combination, confusion networks combination
Transcription (linguistics),Confusion,Handwriting,Iterative and incremental development,Pattern recognition,Computer science,Speech recognition,Artificial intelligence,Text recognition,Speedup
Conference
ISSN
Citations 
PageRank 
1520-5363
4
0.43
References 
Authors
12
2
Name
Order
Citations
PageRank
Emilio Granell1426.80
Carlos D. Martínez-Hinarejos23810.86