Title
Learning-Free Pattern Detection For Manuscript Research: An Efficient Approach Toward Making Manuscript Images Searchable
Abstract
Automatic pattern detection has become increasingly important for scholars in the humanities as the number of manuscripts that have been digitised has grown. Most of the state-of-the-art methods used for pattern detection depend on the availability of a large number of training samples, which are typically not available in the humanities as they involve tedious manual annotation by researchers (e.g. marking the location and size of words, drawings, seals and so on). This makes the applicability of such methods very limited within the field of manuscript research. We propose a learning-free approach based on a state-of-the-art Naive Bayes Nearest-Neighbour classifier for the task of pattern detection in manuscript images. The method has already been successfully applied to an actual research question from South Asian studies about palm-leaf manuscripts. Furthermore, state-of-the-art results have been achieved on two extremely challenging datasets, namely the AMADI_LontarSet dataset of handwriting on palm leaves for word-spotting and the DocExplore dataset of medieval manuscripts for pattern detection. A performance analysis is provided as well in order to facilitate later comparisons by other researchers. Finally, an easy-to-use implementation of the proposed method is developed as a software tool and made freely available.
Year
DOI
Venue
2021
10.1007/s10032-021-00371-7
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION
Keywords
DocType
Volume
Pattern detection, Document analysis, Learning-free, Historical manuscripts, NBNN classifier
Journal
24
Issue
ISSN
Citations 
3
1433-2833
0
PageRank 
References 
Authors
0.34
0
3
Name
Order
Citations
PageRank
Hussein Adnan Mohammed100.34
Volker Märgner229529.02
Giovanni Ciotti300.34