Install Pentaho Data Catalog
This article covers the installation of Data Catalog using the release package. You can install Pentaho Data Storage Optimizer in a new or existing Data Catalog environment.
- Installing Data Catalog with or without Data Storage Optimizer
- Installing Data Storage Optimizer into a Data Catalog deployment
Installing Data Catalog
Before you begin, you must have root
privileges or have the necessary permissions to run Docker as part of the installation process.
PDC_DATA_ENCRYPTION_KEY
environment variable in the conf/.env file. If the variable exists, the conf/.env file is retained. However, if the variable does not exist, Data Catalog generates a new .env file containing a PDC_DATA_ENCRYPTION_KEY
environment variable. If needed, you can add any custom environment variable settings back in to the new .env file from your saved file.Perform the following steps to install Data Catalog:
Procedure
Open a terminal window on your dedicated Data Catalog deployment server.
Save the Data Catalog release package in the Data Catalog server.
Extract the files from the release package to the /opt directory using the following command:
The command creates a pentaho directory and extracts the contents of the deployment into a pdc-docker-deployment subdirectory.tar -xvf [name of release package].tar.gz -C /opt
Start all the Dockers using the following command:
sh pdc.sh up
(Optional) If you are installing Pentaho Data Storage Optimizer, copy and paste the following commands to set environment variables and generate the required tokens, add them to the environment files, and restart all the Docker containers:
echo RULES_PDC_AUTH_TOKEN=\"$(./pdc.sh get-jwt-token RULES_ENGINE)\" >> ./conf/.env echo PDSO_PDC_AUTH_TOKEN=\"$(./pdc.sh get-jwt-token PDSO)\" >> ./conf/.env echo PDSO_VFS_EXTERNAL_HOST_IP=\"$(hostname -I | awk '{print $1}')\" >> ./conf/.env echo PDC_FE_PDSO_URL=/pdso/ >> ./conf/.env echo COMPOSE_PROFILES=mongodb,collab,pdso >> ./conf/.env
CautionModifying these settings can have Pentaho product implications, and incorrect changes may negatively impact the functionality of the other product. It is a best practice to collaborate with your Pentaho Data Catalog partner to ensure that any modifications align with your intended objectives.Restart the Docker containers to update them with the new environment changes:
The installation script uses the packaged Docker images for the Data Catalog release and the Data Storage Optimizer release, if installed, to create and run Docker containers on your dedicated server. The installation finishes when each Docker container has successfully started.sh pdc.sh up
Access Data Catalog and Data Storage Optimizer, if installed, through your browser (the Chrome browser is recommended) using the server name or IP address and confirm that the applications are successfully installed and running.
NoteFor new installations, you are redirected to the Create Admin Account page.
Results
Next steps
Installing Data Storage Optimizer into a Data Catalog deployment
The process below installs Data Storage Optimizer to an existing Data Catalog deployment.
Perform the following steps to install Data Storage Optimizer into Data Catalog:
Procedure
Open a terminal window on your dedicated Data Catalog deployment server.
On the Data Catalog server, navigate to the pentaho/pdc-docker-deployment directory using the following command:
cd pentaho/pdc-docker-deployment
Start all the Dockers using the following command:
sh pdc.sh up
Copy and paste the following commands to generate the required tokens and add them to the environment files:
echo RULES_PDC_AUTH_TOKEN=\"$(./pdc.sh get-jwt-token RULES_ENGINE)\" >> ./conf/.env echo PDSO_PDC_AUTH_TOKEN=\"$(./pdc.sh get-jwt-token PDSO)\" >> ./conf/.env echo PDSO_VFS_EXTERNAL_HOST_IP=\"$(hostname -I | awk '{print $1}')\" >> ./conf/.env echo PDC_FE_PDSO_URL=/pdso/ >> ./conf/.env echo COMPOSE_PROFILES=mongodb,collab,pdso >> ./conf/.env
CautionModifying these settings can have Pentaho product implications, and incorrect changes may negatively impact the functionality of the other products. It is a best practice to collaborate with your Pentaho Data Catalog partner to ensure that any modifications align with your intended objectives.Re-run the installation script:
./pdc.sh up
The installation script uses the packaged Docker images for the Data Storage Optimizer release to create and run Docker containers on your dedicated server. The installation finishes when each Docker container has been successfully started.
Data Storage Optimizer is successfully installed.
Results