Monthly Archives: July 2017

Exposing Kafka on two different networks

This article was implemented using CDH 5.7.1 with Kafka 2.0.1.5 installed using parcels. One of the clusters we are working on has the following network configuration: A "data" network exposing our edge, kafka and master nodes to the outside world An "internal" network dedicated to the cluster for our worker nodes We use Kafka for data [...]

By | 2017-10-24T12:13:22+00:00 July 22nd, 2017|Categories: Blog|Tags: , |0 Comments

Change Ambari’s topbar color

We recently had a client that has multiple environments (Production, Integration, Testing, ...) running on HDP and managed using one Ambari instance per cluster. One of the questions that came up was the folloging: We need a way to distinguish our environment when on Ambari and the cluster name is visually not enough, how can [...]

By | 2017-07-24T21:37:13+00:00 July 9th, 2017|Categories: Hack|Tags: , |0 Comments

MiNiFi: Data at Scales & the Values of Starting Small

This post is part of the Series of the Dataworks Summit 2017 (ex-Hadoop Summit) Speaker is Aldrin Piri from Hortonworks This conference presented rapidly Apache NiFi and explained where MiNiFi came from: basically it's a NiFi minimal agent to deploy on small devices to bring data to a cluster's NiFi pipeline (ex: IoT). Here are [...]

By | 2017-07-24T21:37:13+00:00 July 8th, 2017|Categories: Blog, Events|Tags: , , , , |0 Comments

HDP cluster supervision

About With the current growth of BigData technologies, more and more companies are building their own clusters in hope to get some value of their data. One main concern while building these infrastructures is the capacity to continuously monitor the cluster's health and report issues as fast as possible. This is where supervision comes in. [...]

By | 2017-11-21T20:08:44+00:00 July 5th, 2017|Categories: Big Data|0 Comments