Blog, last published articles

From Dockerfile to Ansible Containers

Presentation by Tomas Tomecek from Red Hat’s containerization team. This talk was an introduction to the Dockerfile format and to Ansible container’s tool and then a comparison of both. […]

By | 2017-11-23T11:21:04+00:00 October 25th, 2017|Categories: Open Source Summit Europe 2017|Tags: , , |0 Comments

Multi-Repo, Multi-Node Gating at Massive Scale

This is a recap and personal review of Monty Taylor’s presentation of OpenStack’s Continuous Integration tool “Zuul” at the OpenSource Summit 2017 in Prague (not to mix with Netflix’ Zuul project). […]

By | 2017-11-23T11:21:51+00:00 October 24th, 2017|Categories: Open Source Summit Europe 2017|Tags: , , |0 Comments

Kubernetes 1.8

The 1.8 release of Kubernetes brings a lot of new things. With 2500+ pull request, 2000+ commits, 400+ commiters, Kubernetes added 39 new features in this version. This is the richest release in terms of features, for comparison the 1.7 version contained only 28 features. For this release the Kubernetes team focused on three main [...]

By | 2017-11-23T11:22:25+00:00 October 24th, 2017|Categories: Open Source Summit Europe 2017|Tags: , |0 Comments

Yahoo’s Vespa Engine

Vespa is Yahoo’s fully autonomous and self-sufficient big data processing and serving engine. It aims at serving results of queries on huge amounts of data in real time. An example of this would be serving search results or recommendations to a user. Yahoo – or Oath – recently made Vespa open source on GitHub. At [...]

By | 2017-11-23T11:23:21+00:00 October 16th, 2017|Categories: Tech Radar|Tags: , |0 Comments

Exposing Kafka on two different networks

This article was implemented using CDH 5.7.1 with Kafka 2.0.1.5 installed using parcels. One of the clusters we are working on has the following network configuration: A "data" network exposing our edge, kafka and master nodes to the outside world An "internal" network dedicated to the cluster for our worker nodes We use Kafka for data [...]

By | 2017-10-24T12:13:22+00:00 July 22nd, 2017|Categories: Blog|Tags: , |0 Comments

Change Ambari’s topbar color

We recently had a client that has multiple environments (Production, Integration, Testing, ...) running on HDP and managed using one Ambari instance per cluster. One of the questions that came up was the folloging: We need a way to distinguish our environment when on Ambari and the cluster name is visually not enough, how can [...]

By | 2017-07-24T21:37:13+00:00 July 9th, 2017|Categories: Hack|Tags: , |0 Comments

MiNiFi: Data at Scales & the Values of Starting Small

This post is part of the Series of the Dataworks Summit 2017 (ex-Hadoop Summit) Speaker is Aldrin Piri from Hortonworks This conference presented rapidly Apache NiFi and explained where MiNiFi came from: basically it's a NiFi minimal agent to deploy on small devices to bring data to a cluster's NiFi pipeline (ex: IoT). Here are [...]

By | 2017-07-24T21:37:13+00:00 July 8th, 2017|Categories: Blog, Events|Tags: , , , , |0 Comments

HDP cluster supervision

About With the current growth of BigData technologies, more and more companies are building their own clusters in hope to get some value of their data. One main concern while building these infrastructures is the capacity to continuously monitor the cluster's health and report issues as fast as possible. This is where supervision comes in. [...]

By | 2017-11-21T20:08:44+00:00 July 5th, 2017|Categories: Big Data|0 Comments