DocumentCode :
3516869
Title :
Towards a semantic, deep archival file system
Author :
Mahalingam, Mallik ; Tang, Chunqiang ; Xu, Zhichen
Author_Institution :
HP Labs., Palo Alto, CA, USA
fYear :
2003
fDate :
28-30 May 2003
Firstpage :
115
Lastpage :
121
Abstract :
In essence, computers are tools to help us with our daily lives. CPUs are extension to our reasoning capability whereas disks are extensions to our memory. But the simple hierarchical namespace of existing file systems is inadequate in managing files today that have rich semantics. In this paper, we advocate the need for integrating semantic information into a storage system. We propose "Sedar", a deep archival file system. Sedar is one of the the first archival file systems that integrates semantic storage and retrieval capabilities. In addition, Sedar introduces several novel features: the notion of "semantic-hashing" to reduce the storage consumption that is robust against misalignment of documents; "virtual snapshot" of namespace, and "conceptual deletions" of files and directories. It exposes a semantic catalog that allows other semantic-based tools (e.g., visualization and statistical analysis) to be built. It uses a decentralized peer-to-peer storage utility enabling horizontal scalability.
Keywords :
data structures; distributed databases; information retrieval systems; network operating systems; Sedar archival file system; decentralized peer-to-peer storage; horizontal scalability; semantic information integration; semantic-based tools; semantic-hashing; Computer science; Content based retrieval; File systems; Humans; Laboratories; Milling machines; Peer to peer computing; Robustness; Statistical analysis; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Distributed Computing Systems, 2003. FTDCS 2003. Proceedings. The Ninth IEEE Workshop on Future Trends of
ISSN :
1071-0485
Print_ISBN :
0-7695-1910-5
Type :
conf
DOI :
10.1109/FTDCS.2003.1204321
Filename :
1204321
Link To Document :
بازگشت