Blog

Running Enterprise Workloads in the Cloud with Cloudbreak

This article is based on Peter Darvasi and Richard Doktorics’ talk Running Enterprise Workloads in the Cloud at the DataWorks Summit 2018 in Berlin. It presents Hortonworks’ automated deployment tool for cloud environments, Cloudbreak, describes and comments features that Peter and Richard explained in their talk, and give some personal guidelines on when and why [...]

By |2018-06-06T09:16:58+00:00May 28th, 2018|Categories: Big Data, DataWorks Summit 2018|Tags: , , , |1 Comment

Omid: Scalable and highly available transaction processing for Apache Phoenix

Apache Omid provides a transactional layer on top of key/value NoSQL databases. In practice, it is usually used on top of Apache HBase. […]

By |2018-06-05T22:36:36+00:00May 24th, 2018|Categories: Big Data, DataWorks Summit 2018, Events|Tags: , , , , , |1 Comment

Apache Beam: a unified programming model for data processing pipelines

In this article, we will review the concepts, the history and the future of Apache Beam, that may well become the new standard for data processing pipelines definition. […]

Present and future of Hadoop workflow scheduling: Oozie 5.x

During the DataWorks Summit Europe 2018 in Berlin, I had the opportunity to attend a breakout session on Apache Oozie. It covers the new features released in Oozie 5.0, including future features of Oozie 5.X, which is the main subject of this article. They spent some time discussing the Apache Ambari’s Workflow Scheduler and its way [...]

By |2018-06-05T22:36:37+00:00May 23rd, 2018|Categories: Big Data, DataWorks Summit 2018|Tags: , |2 Comments

Essential questions about Time Series

Today, the bulk of Big Data is temporal. We see it in the media and among our customers: smart meters, banking transactions, smart factories, connected vehicles … IoT and Big Data go hand in hand. […]

By |2018-06-05T22:36:40+00:00March 19th, 2018|Categories: Big Data, Data Engineering|Tags: , , , , , |0 Comments

Notes after Katacoda Training on Kubernetes Container Orchestration

A few weeks ago, I dedicated two days to follow the turorials available on Katacoda, the interactive learning platform for Kubernetes or any other container orchestration platform. I’m sharing my notes which I happen to use regularly as a cheat sheet. […]

By |2018-06-05T22:36:42+00:00December 14th, 2017|Categories: Container|Tags: , , , |0 Comments

Open Source Summit 2017 – a week in Pragues

The Adaltas crew went to the Open Source Summit 2017 as well as the Mesos Summit 2017 held in Pragues about 3 weeks back. On this occasion, we compiled a series of articles about the conferences that have marked us most. Over the 3-day period of the Open Source Summit, there is no doubt [...]

By |2018-06-05T22:36:44+00:00November 23rd, 2017|Categories: Events|0 Comments