Title
MyStore: A High Available Distributed Storage System for Unstructured Data
Abstract
Although some NoSQL systems such as Dynamo, Cassandra, MongoDB have provided different advantages for unstructured data management, no one can provide flexible query functions like MongoDB, while guarantee the availability and scalability as Cassandra simultaneously. This paper introduces a new methodology and implementation for improving the availability of unstructured data by presenting a new distributed storage system called MyStore, based on the combination of MongoDB and some advantages from other NoSQL systems. Consistent hash is used to distribute data on multiple MongoDB nodes, NWR mode is applied to provide automatic backup operation and guarantee data consistency. Gossip protocol is taken for exchanging information of failures in the system. Based on above strategies, a high available and scalable system for storing unstructured data is realized, which can also provide complex query functions like rational database. Moreover, this system is applied in a multi-discipline virtual experiment platform named VeePalms that requires high availability and high access efficiency for its unstructured data such as XML scene, video guideline. Experimental evaluation shows that the methodology is powerful enough not only to enhance the data availability, but also to improve the server's scalability.
Year
DOI
Venue
2012
10.1109/HPCC.2012.39
HPCC-ICESS
Keywords
Field
DocType
unstructured data management,data availability,storage system,multiple mongodb node,scalable system,high available,guarantee data consistency,nosql system,high access efficiency,unstructured data,high availability,data consistency,gossip protocol,protocols,data integrity,information exchange,writing,distributed data storage,failure analysis,distributed processing,scalability,memory,distributed databases,sql,availability,generators
Computer science,Parallel computing,Distributed data store,Unstructured data,NoSQL,Data integrity,Distributed database,High availability,Backup,Database,Distributed computing,Scalability
Conference
ISSN
Citations 
PageRank 
2576-3504
0
0.34
References 
Authors
10
5
Name
Order
Citations
PageRank
Wenbin Jiang135536.55
Lei Zhang281.59
Weizhong Qiang313727.22
Hai Jin46544644.63
Yaqiong Peng572.58