Skip to main content

Pentaho+ documentation has moved!

The new product documentation portal is here. Check it out now at docs.hitachivantara.com

 

Hitachi Vantara Lumada and Pentaho Documentation

Launching Data Catalog VMI in Azure

You can launch a new Data Catalog Virtual Machine Image (VMI) from the Microsoft Azure Marketplace. To use the Data Catalog VMI, you must create a customized network security group (NSG) and select or create an SSH key pair during the launch configuration.

Launch the Data Catalog VMI instance

Perform the following steps to launch an instance of the Data Catalog VMI from the Azure Marketplace:

Procedure

  1. Navigate to the Azure Marketplace, then search for Pentaho Data Catalog.

    NoteYou can also launch a Data Catalog instance from the Azure portal.
  2. In the search results, click the Pentaho Data Catalog card.

  3. Review the product details and terms, then click Get It Now.

  4. Review your personal details on the window that opens and click Continue.

    A Pentaho Data Catalog window opens.
  5. Review the product details and terms, then click Create to initiate the launch.

    NoteIf you don't already have a subscription, you must choose a subscription option.
    The Create a virtual machine page opens.
  6. In the Administrator account section, configure access to the instance using one of the following options for Authentication type:

    • Select SSH public key to use the SSH public key that corresponds to your SSH private key. Enter pentaho for the SSH username.
    • Select Password to authenticate using a username and password.
  7. Click Next: Disks to proceed to the OS disk section, and specify the virtual machine parameters to use for the Data Catalog instance.

  8. Click Next: Networking to proceed to the Network interface section, then specify the virtual network and subset of a Network Security Group that has access to ports 22 and 443.

  9. (Optional) Click through the remaining sections on the page to configure additional settings.

  10. Click Review + Create to review the launch configuration, then click Create instance to launch the instance.

  11. Record the instance’s IP address or URL.

    This is needed for the Set up an administrator account for the Azure instance procedure.

Next steps

Once the instance is running, you can connect to it using HTTPS in the browser on port 443. You might need to create a new rule or edit an existing rule to allow traffic on port 443 from your desired IP addresses or IP ranges.

Set up an administrator account for the Azure instance

You must set up an administrator account to manage your Data Catalog instance in the Azure Marketplace.

Before you begin

Before you begin this procedure, you must have an IP address or URL for accessing the Data Catalog instance and an environment that meets the following conditions:
  • an active Data Catalog instance in the Azure Marketplace.
  • traffic allowed on port 443 from your desired IP addresses or IP ranges.

Perform the following steps to set up the account:

Procedure

  1. In a browser, navigate to the Data Catalog IP address or URL resulting from the Launch the Data Catalog VMI instance procedure.

    You must use HTTPS to access the instance. You might see a NET::ERR_CERT_AUTHORITY_INVALID error message, due to Data Catalog's self-signed certificate.
  2. Ignore the error and proceed.

    You can add your own certificates to Data Catalog later.You are redirected to the Data Catalog admin account registration page.
  3. On the Create Admin Account page, provide details for the Data Catalog admin user and click Create Account.

    You are logged in to the admin account and see the Data Catalog home page.

Next steps

You can begin using Data Catalog or create accounts for other users in your organization.