Title
Developing an Open Source 'Big Data' Cognitive Computing Platform: Big Data (Ubiquity symposium)
Abstract
The ability to leverage diverse data types requires a robust and dynamic approach to systems design. The needs of a data scientist are as varied as the questions being explored. Compute systems have focused on the management and analysis of structured data as the driving force of analytics in business. As open source platforms have evolved, the ability to apply compute to unstructured information has exposed an array of platforms and tools available to the business and technical community. We have developed a platform that meets the needs of the analytics user requirements of both structured and unstructured data. This analytics workbench is based on acquisition, transformation, and analysis using open source tools such as Nutch, Tika, Elastic, Python, PostgreSQL, and Django to implement a cognitive compute environment that can handle widely diverse data, and can leverage the ever-expanding capabilities of infrastructure in order to provide intelligence augmentation.
Year
DOI
Venue
2018
10.1145/3158344
Ubiquity
Field
DocType
Volume
Data science,Computer science,Systems design,Knowledge management,Unstructured data,Analytics,Data model,Big data,User requirements document,Cognitive computing,Python (programming language)
Journal
2018
Issue
ISSN
Citations 
March
1530-2180
0
PageRank 
References 
Authors
0.34
4
2
Name
Order
Citations
PageRank
Michael Kowolenko100.34
Mladen A. Vouk245249.92