Skip to main content

Pentaho+ documentation has moved!

The new product documentation portal is here. Check it out now at


Hitachi Vantara Lumada and Pentaho Documentation


This article covers the installation of Lumada Data Catalog onto a Kubernetes cluster, using Helm and Hitachi Vantara owned Docker images. By the end of these steps, the following Data Catalog components will be set up on your designated Kubernetes cluster:

Component Description
LDC Application ServerRuns the Data Catalog application
Agent (Local agent)Runs jobs executed by the application server
MongoDBDatabase backing the application
KeycloakOpen Source Identity Provider (IDP)
MinIOObject storage (used for debug purposes only)
Spark History ServerHelps store spark job logs on an MinIO or S3 bucket (by default this will be in the MinIO component included in set up).

Based on what deployment pattern your organization will use, standalone remote agents can also be set up. Serving the same purpose as a local agent, a remote agent is set up on an edge node of a separate Hadoop/Spark cluster.

The installation process includes: