Title
Experiences with Managing Data Ingestion into a Corporate Datalake
Abstract
We explain our experiences in designing, building and running a large corporate Datalake. Our platform has been running for over two years and makes a wide variety of corporate data assets, such as sales, marketing, customer information, as well as data from less conventional sources such as weather, news and social media available for analytics purposes to many teams across the company. We focus on describing the management of data and in particular how it is transferred and ingested into the platform.
Year
DOI
Venue
2019
10.1109/CIC48465.2019.00021
2019 IEEE 5th International Conference on Collaboration and Internet Computing (CIC)
Keywords
DocType
ISBN
Data Lake,data ingestion,Hadoop,Enterprise
Conference
978-1-7281-6740-4
Citations 
PageRank 
References 
1
0.40
4
Authors
6
Name
Order
Citations
PageRank
Sean Rooney18112.50
Daniel Bauer213613.44
Luis Garcés-Erice310.40
Peter Urbanetz431.47
Florian Froese510.40
Sasa Tomic610.40