Launching a Data Catalog AMI in AWS
You can launch a new Data Catalog Amazon Machine Image (AMI) from the AWS Marketplace. To use the Data Catalog AMI, you must create a customized security group and select or create an SSH key pair during the launch configuration.
Launch the Data Catalog AMI instance
Procedure
From the AWS Console Home page, click EC2.
The EC2 Dashboard opens with a Launch instance card.NoteYou can also launch a Data Catalog AMI from the Amazon Marketplace.On the Launch instance card, click Launch instance.
The Launch an instance page opens.Add a name for the instance.
In the Application and OS images (Amazon Machine Image) card, enter Pentaho Data Catalog in the search field.
On the Pentaho Data Catalog result, click Select.
Review the product overview, details, and pricing, and click Continue to accept the terms.
On the Instance type card, choose an instance type from the list.
For a Production environment, it is a best practice to use a 2xlarge or larger instance type.On the Key pair (login) card, select an existing key pair to connect securely, or create a new key pair.
If you create a new private key, the file downloads automatically to your local computer.NoteMake sure to store the private key file in a secure location, because you need it to connect to the instance using SSH.On the Network settings card, make the following selections:
- Under Firewall (security groups), select an existing security group or create a new one.NoteAny existing security group you select must support SSH and HTTPS traffic.
- Select the Allow SSH traffic from checkbox and choose My IP from the list.NoteUse the username
pentaho
and port 22 for SSH access. - Select the Allow HTTPS traffic from the internet checkbox.
- Under Firewall (security groups), select an existing security group or create a new one.
On the Configure storage card, specify at least 512 GiB for a Production instance.
Click Launch instance.
The instance launches, and you are subscribed to the Marketplace AMI. When the process is complete, a success message includes a link to the instance, with the unique instance ID.Record the instance’s IP address or URL.
It is needed for the Set up an administrator account for the AWS instance procedure.Click the instance link.
The Instances page opens.Select the checkbox next to the Data Catalog instance and click Launch instances.
Next steps
Set up an administrator account for the AWS instance
You must set up an administrator account to manage your Data Catalog instance in the AWS Marketplace.
Before you begin
- an active Data Catalog instance in the AWS Marketplace.
- traffic allowed on port 443 from your desired IP addresses or IP ranges.
Perform the following steps to set up the account:
Procedure
In a browser, navigate to the Data Catalog IP address or URL resulting from the Launch the Data Catalog AMI instance procedure.
You must use HTTPS to access the instance. You might see a NET::ERR_CERT_AUTHORITY_INVALID error message, due to Data Catalog's self-signed certificate.Ignore the error and proceed.
You can add your own certificates to Data Catalog later.You are redirected to the Data Catalog admin account registration page.On the Create Admin Account page, provide details for the Data Catalog admin user and click Create Account.
You are logged in to the admin account and see the Data Catalog home page.
Next steps