Skip to main content

Pentaho+ documentation has moved!

The new product documentation portal is here. Check it out now at


Hitachi Vantara Lumada and Pentaho Documentation

What's New in Lumada Data Catalog

Lumada Data Catalog 7.1.x combines functionality, architecture, and user experience from Data Catalog and Io-Tahoe with improvement in the user interface. Use this release to help you organize and evaluate data, and make it available to your data consumers.


With the new architecture, you can now:

  • Install Data Catalog more quickly than previous versions.
  • Work in multi-cloud environments.
  • Tag data with business terms and use rules that are driven by machine learning to make the data actionable.
  • Use Keycloak as the primary component for managing your users. Role management and RBAC are still maintained within Data Catalog.
  • Communicate directly with the Data Catalog application server:

    • Deploy your local agent on Kubernetes using Spark.

    • Deploy a standalone agent on Hadoop clusters.

  • Use MongoDB as the main metadata store and Object storage for fingerprints.
See Product overview for more information.
New user interface

Data Catalog 7.1.x provides a new graphic user interface for investigating your data. The new UI provides the following features:

  • Navigation

    You can now access your user profile and notifications from the top menu bar or use the menu bar on the left to navigate to different features in the product. See: Quick Start for more information.

  • Data Canvas

    Data Canvas view for Glossary and Virtual Folders now offers the following tasks in the same view:

    • Browsing resources in a tree structure

      All browsing of data assets is performed on metadata in the repository. You are not required to enter their own credentials to access any JDBC data source.

    • Resource details

      You can view summary and details of the resource selected in the tree structure.

    • KPI information

      KPI information is displayed through metrics and statistic widgets.

    See Data Canvas for more information.

  • Data Quality

    As a Data Catalog 7.1.x user, you can now perform the following tasks to help you determine data quality:

    • Data sensitivity

      Determine if data is sensitive and/or confidential using lineage and data quality. See Using a rule for sensitive resource tagging for more information.

    • View Data metrics

      View data quality metrics at the field level. See Navigation pane for more information.

    • View Data quality dimensions

      View data quality dimensions on business rules. See Managing rules for more information.

    • Browse assets

      Browse data assets, such as virtual folders, using metadata. See Exploring your data for more information.

    • Optimize migration

      Use data rationalization to determine potential copies and supersets or subsets that optimize migration. See Data rationalization for more information.

    • Tag data

      You can tag data with terms that are relevant to business users. See Getting started with business terms and term propagation for more information.

    • Build trust in the data

      Use data lineage, data quality and sensitivity labels to build trust in the data. See Lineage discovery for more information on data lineage.

  • Galaxy View

    You can now open the Galaxy View to see a visual representation of your data. You can quickly view the structure of your data, its relationships, and its details. See Using Galaxy View for more information.


Changes in Data Catalog resource terminology are updated for 7.1.x:

  • Tags are now called Business terms
  • Tag Domains are now called Glossaries
Support for new data sources

Data Catalog 7.1.x includes support for the following data sources:

  • Denodo 8.0
  • Vertica 10.x, 11.x