Data Lake
A data lake is a large, centralised repository that stores raw, semi-structured and structured data at scale, typically on low-cost object storage. It is designed for flexible analytics, data science and machine learning. The consensus is that resilient data lakes require governance, quality controls and clear access patterns; otherwise they devolve into unmanaged, risky ‘data swamps’ that are hard to secure and recover.
