Title
Connecting Data For Digital Libraries: The Library, The Dictionary And The Corpus
Abstract
The paper presents two experiments related to enhancing the content of a digital library with data from external repositories. The concept involves three related resources: a digital library of Middle Polish prints where items are stored in image form, the same items in textual form in a linguistically annotated corpus, and a dictionary of Middle Polish. The first experiment demonstrates how the results of automated OCR obtained with open source tools can be replaced with transcribed content from the corpus, enabling the user to search within individual prints. The second experiment links the print content with the electronic dictionary, filtering relevant entries with the dictionary of modern Polish to eliminate redundant results. Interconnecting all relevant resources in a digital library-centered platform creates new possibilities both for researchers involved in development of these resources as well as for scholars studying the Polish language of the 17th and 18th centuries.
Year
DOI
Venue
2019
10.1007/978-3-030-34058-2_13
DIGITAL LIBRARIES AT THE CROSSROADS OF DIGITAL INFORMATION FOR THE FUTURE, ICADL 2019
Keywords
Field
DocType
Digital library, Linguistic corpus, Electronic dictionary, Middle Polish
Information retrieval,Computer science,Polish,Electronic dictionary,Digital library
Conference
Volume
ISSN
Citations 
11853
0302-9743
0
PageRank 
References 
Authors
0.34
0
2
Name
Order
Citations
PageRank
Maciej Ogrodniczuk13811.22
Wlodzimierz Gruszczynski200.68