Skip to main content
- What's new in Data Catalog

- Pentaho Data Catalog 10.0.1 provides support for Pentaho Data Storage Optimizer. Data Storage Optimizer is an intelligent data storage tiering solution that reduces operating costs and gives you seamless access to Hadoop data with S3 compatible object storage like Hitachi Content Platform.
- Product overview

- The Pentaho Data Catalog rapidly ingests, profiles, and meticulously curates structured and unstructured data through a combination of automation and machine learning. This process involves data fingerprinting and the application of metadata rules to provide contextualization aligned with the business's terminology as documented in the business glossary.
- Install

- This article covers the installation of Pentaho Data Catalog. For every release, a release package containing all necessary Data Catalog software and dependencies for installation is made available.NoteTo access the appropriate release package, Hitachi Vantara provides specific credentials along with a URL download link. These credentials grant you access to download the required package for your server.
- Get started with Data Catalog

- Before you use Pentaho Data Catalog, you must plan the data sources you want to add and the glossaries and terms it will use.
- Use Data Catalog

- Use these articles to understand how to perform essential tasks in Pentaho Data Catalog. This section is intended for non-administrative users of the catalog, including data analysts and data stewards.
- Manage

- Administrators can use these articles to learn the tasks in managing the Pentaho Data Catalog, from setting up users and roles to monitoring jobs. These articles are intended for site administrators who are involved in post-installation and maintenance tasks, such as editing configuration properties for scripts or creating virtual folders. Some of these tasks may be performed by the owner of a data node or resource who knows the data well, such as managing jobs. In general, these tasks require an administrator (or data steward in some cases) who knows where the data is stored, how to connect to it, details about the computing environment, and how to use the command line to issue commands for Linux.
- Develop and deploy

-
UtilitiesUse Advanced configuration to support your system infrastructure and effectively maintain Pentaho Data Catalog in your environment.