MapR and Mesosphere are announcing a new open source big data framework (called Myriad) that allows Apache YARN jobs to run alongside other applications and services in enterprise and cloud datacentres. What is Apache YARN? Apache Hadoop YARN (Yet Another Resource Negotiator) is a cluster management technology said to fall into the ‘second-generation’ Hadoop family. […]
Month: February 2015
Making Data Lakes Real-Time with Transactional Hadoop
The phrase “data lake” has become popular for describing the moving of data into Hadoop to create a repository for large quantities of structured and unstructured data in native formats. Yet, many are unfamiliar with the concept of an operational data lake, which enables the structured content of the data lake to be stored in […]
A guide to data mining with Hadoop
How businesses can realise and capitalise on the opportunities that Hadoop offers The idea of gaining knowledge through specialised analysis of mass data started with data collection in the 1960s, and has steadily increased both in the amount of data processed and the sophistication of questions businesses try to answer. Through this progression from static […]