Data lakes can lead to huge cost savings and revenue generation.
But they also raise some tricky questions:
- Is it better to transform into a format optimised for retrieval or keep in raw form?
- How do you control access to prevent breaking Chinese walls?
- How do you choose between streaming the data in and batch loads?
- How do you ensure your data is labelled so it is understood during retrieval?
This white paper addresses such challenges and suggests a number of pragmatic solutions to optimise the performance of your data lake.