Support Ukrain
Adaltas logoAdaltasAdaltas logoAdaltas

Trunk Data Platform (TDP)

Trunk Data Platform (TDP) is a fully open source big data distribution based on the Apache ecosystem. The initiative is incubated by The Open Source I Trust (TOSIT), a French association whose mission is to promote open source between large accounts and institutions.

The TDP distribution is based on the open source versions of big data components of the Apache ecosystem. As part of the TDP project, these components are compiled, tested and deployed automatically.

The TDP distribution defines and qualifies a set of versioned components that interact with each other. In addition, it provides the community with tools for deploying platforms. The resulting stack is versioned and evolves along the following axes:

  • The evolution of the components that compose it by integrating new versions and applying/backporting fixes;
  • Adding new features to the source code of the TDP project.

Any new development has a ripple effect in the compilation of all the components, the validation of tests and the provision of a new version of the distribution in accordance with the recommendations of Semantic Versioning (SemVer).

For ensure the continuation of services, the first versions made available are aligned with those of the HDP 2.6.5 and HDP 3.1.5 distributions. The list of supported components includes: Hadoop (HDFS, YARN, MapReduce), Hive & Tez, Spark, Ranger, HBase, Phoenix, Knox, Oozie, NiFi, Kafka, and ZooKeeper.

Related articles

Spark on Hadoop integration with Jupyter

Spark on Hadoop integration with Jupyter

Categories: Adaltas Summit 2021, Infrastructure, Tech Radar | Tags: YARN, HDP, Infrastructure, Jupyter, Spark, CDP, Notebook, TDP

For several years, Jupyter notebook has established itself as the notebook solution in the Python universe. Historically, Jupyter is the tool of choice for data scientists who mainly develop in Python…



Sep 1, 2022

Canada - Morocco - France

International locations

10 rue de la Kasbah
2393 Rabbat

We are a team of Open Source enthusiasts doing consulting in Big Data, Cloud, DevOps, Data Engineering, Data Science…

We provide our customers with accurate insights on how to leverage technologies to convert their use cases to projects in production, how to reduce their costs and increase the time to market.

If you enjoy reading our publications and have an interest in what we do, contact us and we will be thrilled to cooperate with you.