David Worms

About David Worms

Passionate with programming, data and entrepreneurship, I participate in shaping Adaltas to be a team of talented engineers to share our skills and experiences.

Data Lake ingestion best practices

Creating a Data Lake requires rigor and experience. Here are some good practices around data ingestion both for batch and stream architectures that we recommend and implement with our customers. […]

By |2018-06-18T09:29:50+00:00June 18th, 2018|Categories: Data Engineering, DevOps|Tags: , , , , , , , |0 Comments

DataWorks Summit 2018: A few days speaking Hadoop

The Adaltas crew went to the DataWorks Summit 2018 held in Berlin on the 18th and 19th of April 2018. On this occasion, we compiled a series of articles about the conferences that have marked us most. […]

By |2018-06-05T22:36:32+00:00June 5th, 2018|Categories: DataWorks Summit 2018|Tags: , , , |0 Comments

Essential questions about Time Series

Today, the bulk of Big Data is temporal. We see it in the media and among our customers: smart meters, banking transactions, smart factories, connected vehicles … IoT and Big Data go hand in hand. […]

By |2018-06-05T22:36:40+00:00March 19th, 2018|Categories: Big Data, Data Engineering|Tags: , , , , , |0 Comments

Notes after Katacoda Training on Kubernetes Container Orchestration

A few weeks ago, I dedicated two days to follow the turorials available on Katacoda, the interactive learning platform for Kubernetes or any other container orchestration platform. I’m sharing my notes which I happen to use regularly as a cheat sheet. […]

By |2018-06-05T22:36:42+00:00December 14th, 2017|Categories: Container|Tags: , , , |0 Comments

Open Source Summit 2017 – a week in Pragues

The Adaltas crew went to the Open Source Summit 2017 as well as the Mesos Summit 2017 held in Pragues about 3 weeks back. On this occasion, we compiled a series of articles about the conferences that have marked us most. Over the 3-day period of the Open Source Summit, there is no doubt [...]

By |2018-06-05T22:36:44+00:00November 23rd, 2017|Categories: Events|0 Comments

Micro Services

Back in the days, applications were monolithic and we could use an IP address to access a service. With virtual machines (VM), multiple hosts started to appear on the same machine with multiple apps. Things were still similar with VMs and physical machines as services were still accessible from an IP. With MicroServices, things changed [...]

By |2018-06-05T22:36:46+00:00November 14th, 2017|Categories: Open Source Summit Europe 2017|0 Comments

Hive, Calcite and Druid

BI/OLAP requires interactive visualization of complex data streams: Real time bidding events User activity streams Voice call logs Network trafic flows Firewall events Application KPIs Traditionnal solutions RDBMS (Mysql..): don't scale, need caching but adhoc queries remain slow Key/value store (HBase...): quick but takes forever to compute (pre-materialization of data) Context Created in 2011, open-sourced [...]

By |2018-06-05T22:37:07+00:00July 14th, 2016|Categories: Big Data|0 Comments

A simple connect middleware to transpile CoffeeScript files

This new module called connect-coffee-script is a Connect middleware used to serve javascript files written in CoffeeScript. This middleware is to be used by connect or any Connect compatible framework such as Express and Zappa. For those not familiar with CoffeeScript, it is a transpiler which compile into Javascript. […]

By |2018-06-05T22:37:08+00:00July 4th, 2016|Categories: Hack|0 Comments

Catch ‘uncaughtException’ error in your mocha test

This isn't the first time I faced this situation. Today, I finally found the time and energy to look for a solution. In your mocha test, let's say you need to test an expected "uncaughtException" event, the Node.js technique to catch the uncatchable. Easy, just register an "uncaughtException" listener to the process event emitter. Well, [...]

By |2017-11-21T19:54:26+00:00July 27th, 2013|Categories: Hack|0 Comments