Title
BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
Abstract
The corporate network is often used for sharing information among the participating companies and facilitating collaboration in a certain industry sector where companies share a common interest. It can effectively help the companies to reduce their operational costs and increase the revenues. However, the inter-company data sharing and processing poses unique challenges to such a data management system including scalability, performance, throughput, and security. In this paper, we present Best Peer++, a system which delivers elastic data sharing services for corporate network applications in the cloud based on Best Peer -- a peer-to-peer (P2P) based data management platform. By integrating cloud computing, database, and P2P technologies into one system, Best Peer++ provides an economical, flexible and scalable platform for corporate network applications and delivers data sharing services to participants based on the widely accepted pay-as-you-go business model. We evaluate Best Peer++ on Amazon EC2 Cloud platform. The benchmarking results show that Best Peer++ outperforms Hadoop DB, a recently proposed large-scale data processing system, in performance when both systems are employed to handle typical corporate network workloads. The benchmarking results also demonstrate that Best Peer++ achieves near linear scalability for throughput with respect to the number of peer nodes.
Year
DOI
Venue
2012
10.1109/TKDE.2012.236
Data Engineering
Keywords
Field
DocType
pay-as-you-go business model,public domain software,peer-to-peer systems,large-scale data,linear scalability,amazon ec2 cloud platform,mapreduce,inter-company data,operational cost reduction,peer-to-peer based large-scale data processing platform,inter-company data sharing,corporate network application,elastic data sharing services,corporate network,large-scale data processing platform,hadoopdb,elastic data,typical corporate network workloads,data handling,benchmarking result,industry sector,peer-to-peer computing,business data processing,cloud computing,data management platform,p2p technology,data management system,best peer,intranets,p2p based data management platform,query processing,index,bestpeer++,corporate network workloads,scalability,database,database management systems,indexes,groupware,indexation,collaboration,data processing,servers,security,p2p,data management,business model
Data mining,Peer-to-peer,Computer science,Data sharing,Server,Data processing system,Data management,Database,Benchmarking,Cloud computing,Scalability
Conference
Volume
Issue
ISSN
26
6
1041-4347
ISBN
Citations 
PageRank 
978-1-4673-0042-1
4
0.41
References 
Authors
15
7
Name
Order
Citations
PageRank
Gang Chen179375.07
Tianlei Hu2309.56
Dawei Jiang338021.67
Peng Lu4242.54
Kian-Lee Tan56962776.65
Hoang Tam Vo617011.91
Sai Wu795459.08