Title
A Novel Procedure to Speed up the Transcription of Historical Handwritten Documents by Interleaving Keyword Spotting and user Validation.
Abstract
We propose a novel procedure to speed-up the content transcription of handwritten documents in digital historical archives when a keyword spotting system is used for the purpose. Instead of performing the validation of the system outputs in a single step, as it is customary, the proposed methodology envisaged a multi-step validation process to be embedded into a human-in-the-loop approach. At each step, the system outputs are validated and, whenever an image word that does not correspond to any entry of the keyword list is mistakenly returned by the system, its correct transcription is entered and used to query the system in the next step. The performance of our approach has been experimentally evaluated in terms of the total time to achieve the complete transcription of a subset of documents from the Bentham dataset. The results confirm that interleaving keyword spotting by the system and validation by the user leads to a significant reduction of the time required to transcribe the document content with respect to both the manual transcription and the traditional end-of-the-loop validation process.
Year
DOI
Venue
2019
10.1109/ICDAR.2019.00198
ICDAR
Field
DocType
Citations 
Pattern recognition,Computer science,Keyword spotting,Natural language processing,Artificial intelligence,Interleaving,Speedup
Conference
0
PageRank 
References 
Authors
0.34
0
2
Name
Order
Citations
PageRank
Adolfo Santoro172.72
Angelo Marcelli213932.42