Advanced configuration
After installing Data Catalog, there may be other components you need to set up, depending on your environment. Use the following topics as needed, to finish setting up your environment.
Configure system environment variables
Although not common, there may be instances where you need to change the default settings for Data Catalog system environment variables. These configuration modifications allow you to override default system behavior to align with your specific needs.
Procedure
-
In a terminal window, navigate to the pdc-docker-deployment directory and open the hidden environment variable configuration file (.env). This file is located in the /opt directory by default.
-
Verify the system environment variables set in the /opt/pentaho/pdc-docker-deployment/vendor/.env.default file:
- For example, the number of worker instances that Data Catalog uses to run processes is set to 5:
PDC_WS_DEFAULT_OPS_JOBPOOLMINSIZE=5 PDC_WS_DEFAULT_OPS_JOBPOOLMAXSIZE=5
NoteMake sure thatPDC_WS_DEFAULT_OPS_JOBPOOLMINSIZE
andPDC_WS_DEFAULT_OPS_JOBPOOLMAXSIZE
have the same value for consistent worker instance management.
- For example, the number of worker instances that Data Catalog uses to run processes is set to 5:
-
To override an environment variable set in the vendor/.env.default file, you can create a new .env file in the opt/pentaho/pdc-docker-deployment/conf/ folder:
vi opt/pentaho/pdc-docker-deployment/conf/.env
-
After adding all required system variables, save your changes and restart the Data Catalog system services.
./pdc.sh stop ./pdc.sh up