Title
Parameter Database : Data-centric Synchronization for Scalable Machine Learning
Abstract
We propose a new data-centric synchronization framework for carrying out of machine learning (ML) tasks in a distributed environment. Our framework exploits the iterative nature of ML algorithms and relaxes the application agnostic bulk synchronization parallel (BSP) paradigm that has previously been used for distributed machine learning. Data-centric synchronization complements function-centric synchronization based on using stale updates to increase the throughput of distributed ML computations. Experiments to validate our framework suggest that we can attain substantial improvement over BSP while guaranteeing sequential correctness of ML tasks.
Year
Venue
Field
2015
CoRR
Distributed Computing Environment,Computer science,Correctness,Artificial intelligence,Throughput,Distributed computing,Database-centric architecture,Synchronization,Data synchronization,Synchronization (computer science),Machine learning,Database,Scalability
DocType
Volume
Citations 
Journal
abs/1508.00703
0
PageRank 
References 
Authors
0.34
7
4
Name
Order
Citations
PageRank
Naman Goel1113.60
Divyakant Agrawal282011674.75
Sanjay Chawla31372105.09
Ahmed K. Elmagarmid43720626.92