About Gauthier Leonard

Gauthier Leonard is a Data Engineer in Big Data recently graduated. During his internship at Adaltas, he became familiar with the Hadoop ecosystem and the deployment of secure clusters by developing a cluster provisioning automation tool. Gauthier consolidated his skills during his first assignment as the Big Data referent in a Data Lake project. He helped the customer to design and install an HDP 3 cluster, and set up a first data pipeline using NiFi, Hive 3 (Hive ACID and Hive LLAP) and Oozie.

Apache Beam: a unified programming model for data processing pipelines

In this article, we will review the concepts, the history and the future of Apache Beam, that may well become the new standard for data processing pipelines definition. […]