Manage metadata rules
If Pentaho Data Catalog has Pentaho Data Storage Optimizer integrated and you have the Data Quality Administrator role, you can see an additional tile Metadata Rules in Management. Leveraging the Data Storage Optimizer metadata rule engine capabilities, you can easily manage the data by creating metadata rules. These metadata rules enable you to tag, assign business terms, and add properties to the database resources. By doing so, you can streamline the process of searching for data and make it more efficient.
Create a metadata rule definition
You can create a metadata rule definition by setting the rule criteria and the action. The rule criteria define the conditions that are translated and evaluated into a query for execution against every qualifying resource. A rule action is what the rule does, and it can involve various tasks.
Perform the following steps to create a metadata rule definition:
Procedure
In the left navigation menu, click Management.
The Manage Your Environment page opens.In the Metadata Rules tile, click Add New and then select Add Rule Definition.
The Add Rule Definition page opens.In the Name box, enter a name.
In the Description box, enter a description for the rule definition.
To set the Rule Criteria, select the object type from the list, which includes File, Column, Table, and Folder.
Click Create a Condition or Create a Condition Group to define the rule's conditions.
In the Attribute list, select an attribute.
NoteThe list of available attributes changes based on the object type you selected.Object Attribute Description File Name The name of the file. File Type The format or category of the file that indicates its nature. Last Scanned Date The most recent date when the file was scanned. Creation Date The date when the file was created. Last Modified Date The most recent date when the file content was changed. Last Access Date The latest date the file was opened or accessed. Column Name The name of the column. Data Type The type of data that the column can store, such as text, number, or date. Table The table to which the column belongs. Last Profile Date The most recent date when the column was profiled or analyzed. Table Name The name of the table. Schema Name The name of the schema to which the table belongs. Database Name The name of the database where the table is stored. Last Profile Date The most recent date when the table was profiled or analyzed. Folder Name The name of the folder. Last Scanned Date The most recent date when the folder was scanned. Creation Date The date when the folder was created. Last Modified Date The most recent date when the content of the folder was changed. Last Access Date The latest date the folder was opened or accessed. In the Operator box, select an operator and specify the corresponding value. The query is generated based on the selections.
NoteThe available operators depend on the selected attribute, and the value type varies based on the selected operator.For example, to manage and access documents efficiently, to keep track of the most recent and relevant information. You can assign a business term for the data that is recently modified or accessed. To achieve this, select File as an Object and set the Attribute as the Last Modified Date and Last Access Date.
The query is generated based on the selections.
In the Rule Actions box, set an action type from the following list.
Action Type Description Apply Business Terms Select the term name that you want to add and select the object. This action applies to the Current Object or Parent Object. Add Property Select the property that you want to add and select the object.
This action applies to the Current Object or Parent Object.
Remove Property Select the property that you want to add, and then select the object.
This action applies to the Current Object or Parent Object.
Apply Tags Enter the tag that you want to apply and then click Enter. Remove Tags Enter the tag that you want to remove, and then click Enter. Remove Business Terms Select the term name that you want to remove, and then select the object.
This action applies to the Current Object or the Parent Object.
For example, based on the conditions, set the action Apply Business Term as Latest Updates. If the two conditions mentioned above are met, then the Latest Updates business term will be assigned to all the related metadata that fulfills these criteria.
Click Save.
Results
Update a metadata rule definition
Procedure
In the left navigation menu, click Management.
The Manage Your Environment page opens.In the Metadata Rules tile, click Definitions.
Locate the rule definition you want to edit in the rule definition table, click the More actions (three dots) icon, and then click Edit.
Edit the required fields and then click Save.
Results
Delete a metadata rule definition
Perform the following step to delete the metadata rule definition:
Procedure
In the left navigation menu, click Management.
The Manage Your Environment page opens.In the Metadata Rules tile, click Definitions.
The Manage Rule Definitions page opens.Locate the metadata rule definition you want to delete in the metadata rule definition list, click the More actions (three dots) icon, and then click Delete.
Results
Create a metadata rule
Perform the following steps to create a new rule:
Procedure
1. In the left navigation menu, click Management.
The Manage Your Environment page opens.In the Metadata Rules tile, click Add New, and then select Add Rule.
The Add Rule page opens.Enter a name in Rule Name and select a data source in Source Data Asset.
In Select rule definition, select the rule definition if you have already created it or click Create Rule Definition to create a rule definition.
For more information, see Create a metadata rule definition.Select the Run rule definition now checkbox to run the rule. Alternatively, you can also schedule to run the rule daily or on a specific day.
Click Add Schedule and select Daily or On a date to set the schedule as required.
Results
Update a metadata rule
Procedure
In the left navigation menu, click Management.
The Manage Your Environment page opens.In the Metadata Rules tile, click Rules.
The Metadata Rules page opens.Locate the metadata rule you want to edit in the metadata rules list, click the More actions (three dots) icon, and then select Edit.
The Rule page opens.Edit the required fields and click Apply.
Results
Delete a rule
If a rule is no longer needed, you can delete it. Perform the following steps to delete a rule:
Procedure
In the left navigation menu, click Management.
The Manage Your Environment page opens.In the Metadata Rules tile, click Rules.
The Metadata Rules page opens.Locate the metadata rule you want to delete in the metadata rules list, click the More actions (three dots) icon, and then select Delete.
Results