Support Ukrain
Adaltas logoAdaltasAdaltas logoAdaltas

Spark MLlib

Apache Spark MLlib is a machine learning library which runs on top of Spark core. It supports distributed computing and it can scale vertically and horizontally. It offers APIs for Java, Scala, Python, R and SQL.

It provides tools such as:

  • ML Algorithms: common learning algorithms such as classification, regression, clustering, and collaborative filtering
  • Featurization: feature extraction and selection, transformation, dimensionality reduction
  • Pipelines: tools for constructing, evaluating, and tuning ML pipelines
  • Persistence: saving and loading of algorithms, models and pipelines
  • Utilities: linear algebra, statistics, data handling, etc.
Related tags
Machine Learning

Related articles

MLflow tutorial: an open source Machine Learning (ML) platform

MLflow tutorial: an open source Machine Learning (ML) platform

Categories: Data Engineering, Data Science, Learning | Tags: AWS, Azure, Databricks, Deep Learning, Deployment, Machine Learning, MLflow, MLOps, Python, Scikit-learn

Introduction and principles of MLflow With increasingly cheaper computing power and storage and at the same time increasing data collection in all walks of life, many companies integrated Data Science…

Canada - Morocco - France

International locations

10 rue de la Kasbah
2393 Rabbat

We are a team of Open Source enthusiasts doing consulting in Big Data, Cloud, DevOps, Data Engineering, Data Science…

We provide our customers with accurate insights on how to leverage technologies to convert their use cases to projects in production, how to reduce their costs and increase the time to market.

If you enjoy reading our publications and have an interest in what we do, contact us and we will be thrilled to cooperate with you.