Abstract | ||
---|---|---|
In recent years, real-time processing and analytics systems for big data--in the context of Business Intelligence (BI)--have received a growing attention. The traditional BI platforms that perform regular updates on daily, weekly or monthly basis are no longer adequate to satisfy the fast-changing business environments. However, due to the nature of big data, it has become a challenge to achieve the real-time capability using the traditional technologies. The recent distributed computing technology, MapReduce, provides off-the-shelf high scalability that can significantly shorten the processing time for big data; Its open-source implementation such as Hadoop has become the de-facto standard for processing big data, however, Hadoop has the limitation of supporting real-time updates. The improvements in Hadoop for the real-time capability, and the other alternative real-time frameworks have been emerging in recent years. This paper presents a survey of the open source technologies that support big data processing in a real-time/near real-time fashion, including their system architectures and platforms. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1145/2628194.2628251 | IDEAS |
Keywords | Field | DocType |
big data,systems,miscellaneous,database administration,real-time,survey,architectures,real time | Big data processing,Data science,Data mining,Computer science,Business intelligence,Analytics,Big data,Database,Scalability | Conference |
Citations | PageRank | References |
20 | 0.83 | 18 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xiufeng Liu | 1 | 108 | 14.69 |
Nadeem Iftikhar | 2 | 80 | 11.50 |
Xike Xie | 3 | 87 | 5.83 |