Title
Flexible Transformations For Learning Big Data
Abstract
This paper proposes a domain-specific solution for iterative learning of big and dense (non-sparse) datasets. A large host of learning algorithms, including linear and regularized regression techniques, rely on iterative updates on the data connectivity matrix in order to converge to a solution. The performance of such algorithms often severely degrade when it comes to large and dense data. Massive dense datasets not only induce obligatory large number of arithmetics, but they also incur unwanted message passing cost across the processing nodes. Our key observation is that despite the seemingly dense structures, in many applications, data can be transformed into a new space where sparse structures become revealed. We propose a scalable data transformation scheme that enables creating versatile sparse representations of the data. The transformation can be tuned to benefit the underlying platform's cost and constraints. Our evaluations demonstrate significant improvement in energy usage, runtime, and mem
Year
DOI
Venue
2015
10.1145/2745844.2745889
International Conference on Measurement and Modeling of Computer Systems
Keywords
Field
DocType
big and dense data,performance optimization,sparse factorization,subspace sampling
Computer science,Matrix (mathematics),Theoretical computer science,Iterative learning control,Big data,Message passing,Distributed computing,Scalability
Conference
Volume
Issue
ISSN
43
1
0163-5999
Citations 
PageRank 
References 
2
0.36
3
Authors
4
Name
Order
Citations
PageRank
Azalia Mirhoseini123818.68
Ebrahim M. Songhori21068.05
Bita Darvish Rouhani39913.53
Farinaz Koushanfar43055268.84