Kubernetes

Auto-scaling Druid with Kubernetes

Apache Druid is an open-source analytics data store which could leverage the auto-scaling abilities of Kubernetes due to its distributed nature and its reliance on memory. I was inspired by the talk “Apache Druid Auto Scale-out/in for Streaming Data Ingestion on Kubernetes” by Jinchul Kim during DataWorks Summit 2019 Europe in Barcelona. […]

Google Cloud Summit Paris Notes

Google organized its yearly Summit edition 2019 in Paris on the 18th of June. This year's event was the biggest yet in Paris, which reflect Google's commitment to position itself in the French market. In term of Cloud market shares, Google Cloud Platform (GCP) is still far behind its competitor Amazon AWS and Microsoft Azure. [...]

By |2019-06-26T19:23:32+00:00June 26th, 2019|Categories: Events|Tags: , , , , , |0 Comments

Monitoring a production Hadoop cluster with Kubernetes

Monitoring a production grade Hadoop cluster is a real challenge and needs to be constantly evolving. The software we use today is based on Nagios. Very efficient when it comes to the simplest surveillance, it is not able to meet the need for a more complex verification. In this article, we will propose an architecture [...]

CodaLab – Data Science competitions

CodaLab Competition is a platform for code execution in the field of Data Science. It is a web interface on which a user can submit code or results and compare themselves to others. Let’s see how it works and how to install CodaLab On-Premise. […]

By |2018-12-17T16:45:38+00:00December 17th, 2018|Categories: Big Data, Data Science|Tags: , , , , |0 Comments

Microsoft introduces Cloud Native Application Bundles

At DockerCon EU 2018 in Barcelona, Matt Butcher, Principal Engineer at Microsoft and inventor of Helm, introduced CNAB, Cloud Native Application Bundles, a packaging format for distributed applications, along with Duffle, a CLI tool to run these bundles. […]

By |2018-12-05T10:21:00+00:00December 4th, 2018|Categories: Container, DevOps|Tags: , , , |0 Comments

Lando: Deep Learning used to summarize conversations

Lando is an application to summarize conversations using Speech To Text to translate the written record of a meeting into text and Deep Learning technics to summarize contents. It allows users to quickly understand the context of the conversation. During the cource of our internship at Adaltas, we worked on a new project called Lando to [...]