Skip to main content

Pentaho+ documentation has moved!

The new product documentation portal is here. Check it out now at docs.hitachivantara.com

 

Hitachi Vantara Lumada and Pentaho Documentation

Using Oozie

Parent article

By default, the Oozie user runs the Oozie jobs. If you use PDI to start an Oozie job, you must set up the PDI user to execute the program in proxy. This setup consists of the following tasks:

  • Set up Oozie on a cluster
  • Set up Oozie on the server

Set up Oozie on a cluster

Perform the following steps to add a PDI user to the oozie-site.xml file:

Procedure

  1. Open the oozie-site.xml file on the cluster.

  2. Add the following lines of the code to the oozie-site.xml file on cluster, substituting <your_pdi_user_name> with the PDI user name, such as jdoe.

    <property>
    <name>oozie.service.ProxyUserService.proxyuser.<your_pdi_user_name>.groups</name>
    <value>*</value>
    </property>
    <property>
    <name>oozie.service.ProxyUserService.proxyuser.<your_pdi_user_name>.hosts</name>
    <value>*</value>
    </property>
  3. Save and close the file.

Set up Oozie on the server

Perform the following steps to add a PDI proxy user on the server:

Procedure

  1. Navigate to the <username>/.pentaho/metastore/pentaho/NamedCluster/Configs/<user-defined connection name> directory and open the config.properties file.

    NoteThis filepath and the config.properties file are created when you create a named connection. See Adding a cluster connection
  2. Add the proxy user name to the pentaho.oozie.proxy.user parameter.

  3. Save and close the file.

Next steps

Refer to Connecting to a Hadoop cluster with the PDI client for more connection information.