REST

Auto-scaling Druid with Kubernetes

Apache Druid is an open-source analytics data store which could leverage the auto-scaling abilities of Kubernetes due to its distributed nature and its reliance on memory. I was inspired by the talk “Apache Druid Auto Scale-out/in for Streaming Data Ingestion on Kubernetes” by Jinchul Kim during DataWorks Summit 2019 Europe in Barcelona. […]

Gatsby.js, React and GraphQL for documentation websites

In the last few months, I have started to redesign some of our Open Source project websites. This includes the websites of the Node.js CSV project, the Node.js HBase client and the Nikita project, our homemade system deployment tool. I have been using multiple static website generators in the past but I wanted to try [...]

By |2019-04-16T10:05:33+00:00April 1st, 2019|Categories: Front End|Tags: , , , , , , |0 Comments

Apache Knox made easy!

Apache Knox is the secure entry point of a Hadoop cluster, but can it also be the entry point for my REST applications? […]

Main advantages of GraphQL as an alternative to REST

GraphQL is based on a simple idea, moving the assembly of a request from the server to the client. The client sees the overall strongly-typed schema instead of multiple REST endpoints and he builds the query he wants. My first REST based web application, SPAs for Single Page Applications as we are calling it lately, [...]

By |2018-11-27T09:56:07+00:00November 27th, 2018|Categories: Big Data, Data Science|Tags: , , , , , |0 Comments

HDP cluster supervision

With the current growth of BigData technologies, more and more companies are building their own clusters in hope to get some value of their data. One main concern while building these infrastructures is the capacity to continuously monitor the cluster's health and report issues as fast as possible. This is where supervision comes in. There [...]

By |2019-08-05T21:05:58+00:00July 5th, 2017|Categories: Big Data, DevOps, Infrastructure|Tags: , , , , , |2 Comments

Node HBase, a NodeJs client for Apache HBase

HBase is a "column familly" database from the Hadoop ecosystem built on the model of Google BigTable. HBase can accommodate very large volumes of data (tera or peta) while maintaining high availability and fast response times. Adaltas has posted a Node.js client for HBase whose code is published on GitHub and which uses the REST [...]

By |2019-06-23T21:36:24+00:00November 1st, 2010|Categories: Big Data|Tags: , , , |0 Comments