Tech Radar

Yahoo’s Vespa Engine

Vespa is Yahoo’s fully autonomous and self-sufficient big data processing and serving engine. It aims at serving results of queries on huge amounts of data in real time. An example of this would be serving search results or recommendations to a user. Yahoo – or Oath – recently made Vespa open source on GitHub. At [...]

By |2018-06-05T22:36:53+00:00October 16th, 2017|Categories: Tech Radar|Tags: , |0 Comments

Get in control of your workflows with Apache Airflow

Below is a compilation of my notes taken during the presentation of Airflow by Christian Trebing from BlueYonder. Introduction Use case : how to handle data coming in regularly from customers ? Option 1 : use CRON only time triggers hard error handling inconvenient when overlapping Option 2 : Writing a workflow processing tool start is easy [...]

By |2019-06-19T07:08:27+00:00July 17th, 2016|Categories: Events, Tech Radar|Tags: , , , |0 Comments

Apache Apex : next gen Big Data analytics

Below is a compilation of my notes taken during the presentation of Apache Apex by Thomas Weise from DataTorrent, the company behind Apex. Introduction Apache Apex is an in-memory distributed parallel stream processing engine, like Flink or Storm. However, it is built with native Hadoop integration in mind : Yarn is used for resource managing [...]

By |2019-06-21T19:57:57+00:00July 17th, 2016|Categories: Data Science, Events, Tech Radar|0 Comments

Node.js is now integrated to the Microsoft Azure platform

Node is now a first class citizen in the Microsoft Azure cloud environment alongside .Net, Java and PHP. This integration is the logical consequence of Microsoft's involvement in the development of Node a year ago. Originally only available on Unix platforms (Linux, OSX, ...), Node version 5 was mainly dedicated to porting the engine to [...]

By |2019-06-23T21:52:32+00:00December 11th, 2011|Categories: Tech Radar|Tags: , , , |0 Comments