Abstract | ||
---|---|---|
We explain our experiences in designing, building and running a large corporate Datalake. Our platform has been running for over two years and makes a wide variety of corporate data assets, such as sales, marketing, customer information, as well as data from less conventional sources such as weather, news and social media available for analytics purposes to many teams across the company. We focus on describing the management of data and in particular how it is transferred and ingested into the platform. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1109/CIC48465.2019.00021 | 2019 IEEE 5th International Conference on Collaboration and Internet Computing (CIC) |
Keywords | DocType | ISBN |
Data Lake,data ingestion,Hadoop,Enterprise | Conference | 978-1-7281-6740-4 |
Citations | PageRank | References |
1 | 0.40 | 4 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sean Rooney | 1 | 81 | 12.50 |
Daniel Bauer | 2 | 136 | 13.44 |
Luis Garcés-Erice | 3 | 1 | 0.40 |
Peter Urbanetz | 4 | 3 | 1.47 |
Florian Froese | 5 | 1 | 0.40 |
Sasa Tomic | 6 | 1 | 0.40 |