Loading...
Home2018-11-05T13:43:57+00:00

BigData

Data Engineering

Data Collect, Data Preparation, Data Lake, Data Governance

Data Science

Writing algorithms, Spark, Machine Learning, exploration, statistics, Python, R

Data Streaming

Message Bus, Key Performance Indicator (KPI), Threshold Detection, Time Window Queries, Intelligent Behaviors

Data Analytics

Visualization, notebooks

Latest articles

CodaLab – Data Science competitions

By |December 17th, 2018|Categories: Big Data, Data Science|Tags: , , , , |

CodaLab Competition is a platform for code execution in the field of Data Science. It is a web interface on which a user can submit code or results and compare themselves to others. Let’s see [...]

Microsoft introduces Cloud Native Application Bundles

By |December 4th, 2018|Categories: Container, DevOps|Tags: , , , |

At DockerCon EU 2018 in Barcelona, Matt Butcher, Principal Engineer at Microsoft and inventor of Helm, introduced CNAB, Cloud Native Application Bundles, a packaging format for distributed applications, along with Duffle, a CLI tool to [...]

Main advantages of GraphQL as an alternative to REST

By |November 27th, 2018|Categories: Big Data, Data Science|Tags: , , , , , |

GraphQL is based on a simple idea, moving the assembly of a request from the server to the client. The client sees the overall strongly-typed schema instead of multiple REST endpoints and he builds the [...]

Node.js CSV version 4 – re-writing and performance

By |November 19th, 2018|Categories: Node.js|Tags: , , |

Today, we release a new major version of the Node.js CSV parser project. Version 4 is a complete re-writing of the project focusing on performance. It also comes with new functionalities as well as some [...]

Hadoop cluster takeover with Apache Ambari

By |November 15th, 2018|Categories: Adalas Summit 2018, Big Data|Tags: , , , |

We recently migrated a large production Hadoop cluster from a “manual” automated install to Apache Ambari, we called this the Ambari Takeover. This is a risky process and we will detail why this operation was [...]

Managing User Identities on Big Data Clusters

By |November 8th, 2018|Categories: Big Data, Cyber Security|Tags: , , , , , |

Securing a Big Data Cluster involves integrating or deploying specific services to store users. Some users are cluster-specific when others are available across all clusters. It is not always easy to understand how these different [...]

Apache Flink: past, present and future

By |November 5th, 2018|Categories: Big Data, Data Engineering|Tags: , , , , , , |

Apache Flink is a little gem which deserves a lot more attention. Let’s dive into Flink’s past, its current state and the future it is heading to by following the keynotes and presentations at Flink Forward [...]