Title
A Data Structure To Speed-Up Machine Learning Algorithms On Massive Datasets
Abstract
Data processing in a fast and efficient way is an important functionality in machine learning, especially with the growing interest in data storage. This exponential increment in data size has hampered traditional techniques for data analysis and data processing, giving rise to a new set of methodologies under the term Big Data. Many efficient algorithms for machine learning have been proposed, facing up time and main memory requirements. Nevertheless, this process could still become hard when the number of features or records is extremely high. In this paper, the goal is not to propose new efficient algorithms but a new data structure that could be used by a variety of existing algorithms without modifying their original schemata. Moreover, the proposed data structure enables sparse datasets to be massively reduced, efficiently processing the data input into a new data structure output. The results demonstrate that the proposed data structure is highly promising, reducing the amount of storage and improving query performance.
Year
DOI
Venue
2016
10.1007/978-3-319-32034-2_31
Hybrid Artificial Intelligent Systems
Keywords
Field
DocType
Machine learning, Data structure, Massive data
Data structure,Data mining,Data processing,Computer science,Computer data storage,Algorithm,Artificial intelligence,Computational learning theory,Big data,Schema (psychology),Machine learning,Speedup
Conference
Volume
ISSN
Citations 
9648
0302-9743
2
PageRank 
References 
Authors
0.38
0
4
Name
Order
Citations
PageRank
Francisco Padillo120.38
José M. Luna236623.59
Alberto Cano313011.20
S. Ventura482534.87