Skip to main content

Pentaho+ documentation has moved!

The new product documentation portal is here. Check it out now at docs.hitachivantara.com

 

Hitachi Vantara Lumada and Pentaho Documentation

Release highlights

Lumada Data Catalog version 6.1 delivers a variety of features and enhancements, including Data Rationalization and search improvements. This release also continues to enhance property configuration and the rules interface experience.

For highlights of earlier releases, see the following articles:

Documentation website

Starting in this release, Data Catalog documentation has a new home powered by MindTouch, The new publicly available site provides robust, faceted searching within the documentation, easy-to-navigate breadcrumbs, and a user-friendly look and feel.

To make the most of this dynamic documentation center, we encourage you to register on the site. To register, click Sign In at the top of this page. Also, remember to bookmark this site for ease of access.

Data Rationalization

Data Catalog now features the Data Rationalization dashboard for identifying and analyzing overlaps and copies in your data lake. Duplicate data clutters search results, confuses analysts looking for authoritative data, increases the surface area to secure against breaches, and increases the cost of data storage and processing.

With Data Rationalization, you can identify and analyze copies and overlaps of the data. Data Rationalization uses patented data fingerprinting technology to determine the presence and degree of overlapping data, which is a first step in identifying copies to remove or to promote a well-curated, authoritative copy. See Data Rationalization for further instructions.

Search enhancements

Use the following new features to enhance your searches:

  • Advanced search

    You can now apply tag-based filters that include or exclude selected tags and child tags to perform conjunctive and disjunctive searches.

  • Custom search

    Use the new custom search editor to write free-form searches that include Boolean operators using the Data Catalog search language.

  • Glossary search

    Large, structured domains are great for organizing knowledge, but they can add complexity when you want to update the definition or settings of a tag. You can now search for a tag within the glossary, or nested in a tree below another tag.

Export your findings to a CSV file

You can now export catalog metadata to a CSV file to analyze offline or share, such as the contents of a virtual folder, details of a single resource, a search result, or Data Rationalization findings. See Export your findings to a CSV file for instructions.

Configuration property improvements

To simplify security, you can now edit all the configuration properties through a graphical interface. Previously, some configuration properties could only be edited through the CLI. See Managing configurations for more information.

Rules interface improvements

You can now use the rules interface to enable and disable rules for improved development, insert element prompts, and apply default templates. See Managing rules for more information.