Title
Towards a Semantic, Deep Archival File System
Abstract
In essence, computers are tools to help us with our daily lives. CPUs are extension to our reasoning capability whereas disks are extensions to our memory. But the simple hierarchical namespace of existing file systems is inadequate in managing files today that have rich semantics. In this paper, we advocate the need for integrating semantic information into a storage system. We propose "Sedar", a deep archival file system. Sedar is one of the the firstarchival file systems that integrates semantic storage and retrieval capabilities. In addition, Sedar introduces several novel features: the notion of "semantic-hashing" to reduce the storage consumption that is robust against misalignment of documents; "virtual snapshot" of namespace, and "conceptual deletions" of files and directories. It exposes a semantic catalog that allows other semantic-based tools (e.g., visualization and statistical analysis) to be built. It uses a decentralized peer-to-peer storage utility enabling horizontal scalability.
Year
DOI
Venue
2003
10.1109/FTDCS.2003.1204321
FTDCS
Keywords
Field
DocType
file system,semantic storage,semantic catalog,decentralized peer-to-peer storage utility,storage consumption,storage system,deep archival file system,firstarchival file system,simple hierarchical namespace,semantic information,distributed databases,statistical analysis,visualization,robustness,data structures,information retrieval systems,computer science,p2p
File system,World Wide Web,Global Namespace,Information retrieval,Computer data storage,Computer science,Namespace,Distributed database,Snapshot (computer storage),Semantics,Scalability
Conference
ISBN
Citations 
PageRank 
0-7695-1910-5
20
1.07
References 
Authors
16
3
Name
Order
Citations
PageRank
Mallik Mahalingam119414.52
Chunqiang Tang2128775.09
Zhichen Xu3105766.72