Trunk Data Platform (TDP)
Trunk Data Platform (TDP) is a fully open source big data distribution based on the Apache ecosystem. The initiative is incubated by The Open Source I Trust (TOSIT), a French association whose mission is to promote open source between large accounts and institutions.
The TDP distribution is based on the open source versions of big data components of the Apache ecosystem. As part of the TDP project, these components are compiled, tested and deployed automatically.
The TDP distribution defines and qualifies a set of versioned components that interact with each other. In addition, it provides the community with tools for deploying platforms. The resulting stack is versioned and evolves along the following axes:
- The evolution of the components that compose it by integrating new versions and applying/backporting fixes;
- Adding new features to the source code of the TDP project.
Any new development has a ripple effect in the compilation of all the components, the validation of tests and the provision of a new version of the distribution in accordance with the recommendations of Semantic Versioning (SemVer).
For ensure the continuation of services, the first versions made available are aligned with those of the HDP 2.6.5 and HDP 3.1.5 distributions. The list of supported components includes: Hadoop (HDFS, YARN, MapReduce), Hive & Tez, Spark, Ranger, HBase, Phoenix, Knox, Oozie, NiFi, Kafka, and ZooKeeper.
Ever since Cloudera and Hortonworks merged, the choice of commercial Hadoop distributions for on-prem workloads essentially boils down to CDP Private Cloud. CDP can be seen as the “best of both worlds…
Apr 14, 2022
When using an operating system, upgrading packages or installing new ones are common tasks that introduce the risk of affecting the stability of the system. NixOS is a Linux distribution that ensures…
Feb 8, 2022
Nix is a functional package manager for Linux and other Unix systems, making the management of packages more reliable and easy to reproduce. With a traditional package manager, when updating a package…
Feb 1, 2022
Job Description Big Data and distributed computing is at Adaltas’ core. We support our partners in the deployment, maintenance and optimization of some of France’s largest clusters. Adaltas is also an…
By Daniel HARTY
Oct 25, 2021
The Hadoop ecosystem gave birth to many popular projects including HBase, Spark and Hive. While technologies like Kubernetes and S3 compatible object storages are growing in popularity, HDFS and YARN…
Dec 18, 2020
The Hortonworks HDP distribution will soon be deprecated in favor of Cloudera’s CDP. One of our clients wanted a new Apache Hive feature backported into HDP 2.6.0. We thought it was a good opportunity…
Oct 6, 2020
Commercial Apache Hadoop distributions have come and gone. The two leaders, Cloudera and Hortonworks, have merged: HDP is no more and CDH is now CDP. MapR has been acquired by HP and IBM BigInsights…
Aug 4, 2020