Big Data

Hadoop installation on OSX in pseudo-distributed mode

[crayon-5d7ff1a8d4ff4144161247/] The operating system chosen is OSX but the procedure is not so different for any Unix environment because most of the software is downloaded from the Internet, uncompressed and set manually. Only a few packages are installed by Macport but these are easily found on equivalent tools like Apt and Yum. Since the downloaded [...]

By |2019-06-23T21:39:15+00:00December 1st, 2010|Categories: Hack|Tags: , , , |0 Comments

Node HBase, a NodeJs client for Apache HBase

HBase is a "column familly" database from the Hadoop ecosystem built on the model of Google BigTable. HBase can accommodate very large volumes of data (tera or peta) while maintaining high availability and fast response times. Adaltas has posted a Node.js client for HBase whose code is published on GitHub and which uses the REST [...]

By |2019-06-23T21:36:24+00:00November 1st, 2010|Categories: Big Data|Tags: , , , |0 Comments

MapReduce introduction

Information systems have more and more data to store and process. Companies like Google, Facebook, Twitter and many others store astronomical amounts of information from their customers and must be able to serve them with the best recommendations while ensuring the sustainability of their systems. Description MapReduce is a way of modeling a program to [...]

By |2019-06-21T23:22:29+00:00June 26th, 2010|Categories: Big Data|Tags: , , , |0 Comments