Title
Scalable Digital Libraries Based on NCSTRL/Dienst
Abstract
NCSTRL (The Networked Computer Science Technical Report Library) is a successful digital library for scientific and technical information. It uses the Dienst protocol that was developed by ARPA-funded CS-TR project. We encountered several problems while implementing NCSTRL based large-scale libraries: UPS for Los Alamos and JDL for JTASC. The document collection for these libraries can range from several hundred thousands to few millions. The first problem we found that the native Dienst implementation does not scale beyond approximately 30,000 records. Secondly we found that the implementation is tightly coupled to the Unix platform. Finally, for a large number of hits the NCSTRL search interface support is limited in terms of usability. To address these problems, we replaced the Dienst repository service implementation with an Oracle-based implementation using servlet technology. The Oracle database stores the index information (metadata) and is partitioned horizontally to speed searching through different archives. Furthermore, indexes were built in order to speed the search by different key items such as the author name, the title and the abstract. Our implementation significantly reduced the average wait time for a user for searches that resulted in a large number of hits. In addition, we get all the other benefits of using servlet technology such as efficiency and portability. In this paper, we present the performance results of the new implementation and compare it with that of the implementation of the Dienst protocol in NCSTRL.
Year
DOI
Venue
2000
10.1007/3-540-45268-0_16
ECDL3
Keywords
DocType
Volume
oracle-based implementation,new implementation,ncstrl search interface support,different key item,servlet technology,different archives,large number,dienst protocol,dienst repository service implementation,native dienst implementation,scalable digital,indexation,digital library
Conference
1923
ISSN
ISBN
Citations 
0302-9743
3-540-41023-6
7
PageRank 
References 
Authors
1.21
10
5
Name
Order
Citations
PageRank
Kurt Maly1567139.93
Mohammad Zubair258789.90
Hesham Anan3173.11
Dun Tan4111.94
Yunchuan Zhang581.56