Skip to content
Home
  • Contact
  • About

Category: Data lake

Making Data Lakes Real-Time with Transactional Hadoop

Posted on February 15, 2015 by admin

The phrase “data lake” has become popular for describing the moving of data into Hadoop to create a repository for large quantities of structured and unstructured data in native formats. Yet, many are unfamiliar with the concept of an operational data lake, which enables the structured content of the data lake to be stored in […]

Read More…

Posted in Data lakeLeave a comment

10 Amazing Things to Do With a Hadoop-Based Data Lake

Posted on December 24, 2014 by admin

10 Amazing Things to Do With a Hadoop-Based Data Lake The following is a summary of a talk I gave at Strata NY that is proving popular among a lot of people who are still trying to understand use cases for Hadoop and big data. In this talk, I introduce the concept of a Big […]

Read More…

Posted in Data lake, HadoopLeave a comment

Learning to Swim in the Data Lake

Posted on October 30, 2014 by admin

The data lake approach is increasingly being championed as a way to realize the promise of big data. This allows organizations to avoid transforming and loading data into a purpose-built data store, and instead, move data in its raw form in a data lake until they need it. The goal is to eliminate data silos […]

Read More…

Posted in Data lakeLeave a comment

Subscribe


Recent Posts

  • Sqoop CMD to export MYSQL table to Hive
  • Hive commands at your fingertips
  • Spark – Word count
  • Spark Getting started – Local development using eclipse
  • Easy Analysis on BigData with Bigsheets

Recent Comments

  • yuvraj on Apache Spark-Difference between reduceByKey, groupByKey and combineByKey
  • Vidya on Spark – Word count
  • Sanchita on Spark – Word count
  • Subhashree on Spark – Word count

Archives

  • January 2018
  • November 2016
  • July 2016
  • June 2016
  • May 2016
  • April 2016
  • March 2016
  • February 2016
  • July 2015
  • April 2015
  • March 2015
  • February 2015
  • January 2015
  • December 2014
  • November 2014
  • October 2014
  • September 2014
  • August 2014
  • July 2014
  • June 2014
  • May 2014
  • April 2014
  • March 2014
  • February 2014
  • January 2014
  • December 2013
  • November 2013

Categories

  • Big Data
  • Big Data
  • Big Data
  • Data lake
  • Data Warehouse
  • Developers
  • Hadoop
  • HDFS
  • RDBMS
  • RDBMS
  • Spark
  • Uncategorized