Abstract | ||
---|---|---|
In essence, computers are tools to help us with our daily lives. CPUs are extension to our reasoning capability whereas disks are extensions to our memory. But the simple hierarchical namespace of existing file systems is inadequate in managing files today that have rich semantics. In this paper, we advocate the need for integrating semantic information into a storage system. We propose "Sedar", a deep archival file system. Sedar is one of the the firstarchival file systems that integrates semantic storage and retrieval capabilities. In addition, Sedar introduces several novel features: the notion of "semantic-hashing" to reduce the storage consumption that is robust against misalignment of documents; "virtual snapshot" of namespace, and "conceptual deletions" of files and directories. It exposes a semantic catalog that allows other semantic-based tools (e.g., visualization and statistical analysis) to be built. It uses a decentralized peer-to-peer storage utility enabling horizontal scalability. |
Year | DOI | Venue |
---|---|---|
2003 | 10.1109/FTDCS.2003.1204321 | FTDCS |
Keywords | Field | DocType |
file system,semantic storage,semantic catalog,decentralized peer-to-peer storage utility,storage consumption,storage system,deep archival file system,firstarchival file system,simple hierarchical namespace,semantic information,distributed databases,statistical analysis,visualization,robustness,data structures,information retrieval systems,computer science,p2p | File system,World Wide Web,Global Namespace,Information retrieval,Computer data storage,Computer science,Namespace,Distributed database,Snapshot (computer storage),Semantics,Scalability | Conference |
ISBN | Citations | PageRank |
0-7695-1910-5 | 20 | 1.07 |
References | Authors | |
16 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Mallik Mahalingam | 1 | 194 | 14.52 |
Chunqiang Tang | 2 | 1287 | 75.09 |
Zhichen Xu | 3 | 1057 | 66.72 |