Title
Exploring large-scale small file storage for search engines
Abstract
Large-scale small file storage for original pages degrades performance of search engines. In this paper, we first analyze the disadvantages of the existing EXT3 file system in accessing small files. Then, the rate and speed of compression algorithms are verified to choose a proper storage compression algorithm. Meanwhile, we design an original page oriented file organization structure and a read---write query tree to store the large-scale small files which need no modification. The accessing response time and disk space waste are remarkably decreased when search engines use these techniques to store original-page small files.
Year
DOI
Venue
2016
10.1007/s11227-015-1394-z
The Journal of Supercomputing
Keywords
Field
DocType
Search engine,Small file storage,Storage time,Storage space
File format,File Control Block,File system,Stub file,Computer science,Unix file types,Versioning file system,File system fragmentation,Operating system,Database,Computer file
Journal
Volume
Issue
ISSN
72
8
0920-8542
Citations 
PageRank 
References 
6
0.60
3
Authors
5
Name
Order
Citations
PageRank
Weizhe Zhang128753.07
gangzhao lu260.60
Hui He38016.45
Qizhen Zhang4234.74
chuanliang yu560.60