Title
Towards an omnilingual word retrieval system for ancient manuscripts
Abstract
In this article, we introduce the first method that allows the indexation of ancient manuscripts of any language and alphabet. We describe a word retrieval engine inspired by recent word-spotting advances on ancient manuscripts. Our approach does not need any layout segmentation and makes use of features fitted to any type of alphabet (Latin, Arabic, Chinese, etc.) and writing. The engine is tested on numerous documents and in several use-cases.
Year
DOI
Venue
2009
10.1016/j.patcog.2009.01.026
Pattern Recognition
Keywords
Field
DocType
numerous document,layout segmentation,ancient manuscript,omnilingual word retrieval system,ancient documents,word retrieval,word retrieval engine,recent word-spotting advance,document indexing,omnilingual,word-spotting,segmentation-free,use case
Arabic,Information retrieval,Segmentation,Search engine indexing,Natural language processing,Artificial intelligence,Mathematics,Word processing,Alphabet
Journal
Volume
Issue
ISSN
42
9
Pattern Recognition
Citations 
PageRank 
References 
58
2.12
20
Authors
4
Name
Order
Citations
PageRank
Yann Leydier117410.13
Asma Ouji2693.47
Frank Lebourgeois325623.94
Hubert Emptoz438338.09