The phrase “data lake” has become popular for describing the moving of data into Hadoop to create a repository for large quantities of structured and unstructured data in native formats. Yet, many are unfamiliar with the concept of an operational data lake, which enables the structured content of the data lake to be stored in […]
Category: Data lake
10 Amazing Things to Do With a Hadoop-Based Data Lake
10 Amazing Things to Do With a Hadoop-Based Data Lake The following is a summary of a talk I gave at Strata NY that is proving popular among a lot of people who are still trying to understand use cases for Hadoop and big data. In this talk, I introduce the concept of a Big […]
Learning to Swim in the Data Lake
The data lake approach is increasingly being championed as a way to realize the promise of big data. This allows organizations to avoid transforming and loading data into a purpose-built data store, and instead, move data in its raw form in a data lake until they need it. The goal is to eliminate data silos […]