Title
A Transcription Is All You Need: Learning to Align Through Attention
Abstract
Historical ciphered manuscripts are a type of document where graphical symbols are used to encrypt their content instead of regular text. Nowadays, expert transcriptions can be found in libraries alongside the corresponding manuscript images. However, those transcriptions are not aligned, so these are barely usable for training deep learning-based recognition methods. To solve this issue, we propose a method to align each symbol in the transcript of an image with its visual representation by using an attention-based Sequence to Sequence (Seq2Seq) model. The core idea is that, by learning to recognise symbols sequence within a cipher line image, the model also identifies their position implicitly through an attention mechanism. Thus, the resulting symbol segmentation can be later used for training algorithms. The experimental evaluation shows that this method is promising, especially taking into account the small size of the cipher dataset.
Year
DOI
Venue
2021
10.1007/978-3-030-86198-8_11
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I
Keywords
DocType
Volume
Handwritten symbol alignment, Hand-drawn symbol recognition, Sequence to Sequence, Attention models
Conference
12916
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Pau Torras100.34
Mohamed Ali Souibgui201.35
Jialuo Chen300.34
Alicia Fornés456348.56