Abstract | ||
---|---|---|
The paper presents two experiments related to enhancing the content of a digital library with data from external repositories. The concept involves three related resources: a digital library of Middle Polish prints where items are stored in image form, the same items in textual form in a linguistically annotated corpus, and a dictionary of Middle Polish. The first experiment demonstrates how the results of automated OCR obtained with open source tools can be replaced with transcribed content from the corpus, enabling the user to search within individual prints. The second experiment links the print content with the electronic dictionary, filtering relevant entries with the dictionary of modern Polish to eliminate redundant results. Interconnecting all relevant resources in a digital library-centered platform creates new possibilities both for researchers involved in development of these resources as well as for scholars studying the Polish language of the 17th and 18th centuries. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1007/978-3-030-34058-2_13 | DIGITAL LIBRARIES AT THE CROSSROADS OF DIGITAL INFORMATION FOR THE FUTURE, ICADL 2019 |
Keywords | Field | DocType |
Digital library, Linguistic corpus, Electronic dictionary, Middle Polish | Information retrieval,Computer science,Polish,Electronic dictionary,Digital library | Conference |
Volume | ISSN | Citations |
11853 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 0 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Maciej Ogrodniczuk | 1 | 38 | 11.22 |
Wlodzimierz Gruszczynski | 2 | 0 | 0.68 |