Install
This article covers the installation of Lumada Data Catalog onto a Kubernetes cluster, using Helm and Hitachi Vantara owned Docker images. By the end of these steps, the following Data Catalog components will be set up on your designated Kubernetes cluster:
Component | Description |
LDC Application Server | Runs the Data Catalog application |
Agent (Local agent) | Runs jobs executed by the application server |
MongoDB | Database backing the application |
Keycloak | Open Source Identity Provider (IDP) |
MinIO | Object storage (used for debug purposes only) |
Spark History Server | Helps store spark job logs on an MinIO or S3 bucket (by default this will be in the MinIO component included in set up). |
Based on what deployment pattern your organization will use, standalone remote agents can also be set up. Serving the same purpose as a local agent, a remote agent is set up on an edge node of a separate Hadoop/Spark cluster.
The installation process includes: