Lumada Data Catalog 7.0
Welcome to Lumada Data Catalog 7.0. Lumada Data Catalog automates discovery, classification, and management of your enterprise data. Data Catalog services big data, data warehouses, cloud services, and databases across your enterprise. Its patented fingerprinting AI and machine learning capabilities surface data that matters, laying the foundation for efficient and successful analytics, data governance, and compliance. For previous versions, see https://help.hitachivantara.com/Docu...a_Data_Catalog.
- What's New in Lumada Data Catalog
- Lumada Data Catalog 7.0.1 combines functionality, architecture, and user experience from Data Catalog and Io-Tahoe with improvement in the user interface. Use this release to help you organize and evaluate data, and make it available to your data consumers.
- Product overview
- The Lumada Data Catalog software builds a metadata catalog from data assets residing in tools such as Apache HDFS??? and Hive???, S3, MySQL???, Oracle??, Amazon Redshift??, and Teradata??. It profiles the data assets to produce field-level data quality statistics and to identify representative data so users can efficiently analyze the content and quality of the data.
- Install for 7.1
- This article covers the installation of Lumada Data Catalog onto a Kubernetes cluster, using Helm and Hitachi Vantara owned Docker images. By the end of these steps, the following Data Catalog components will be set up on your designated Kubernetes cluster:
- Get started
- Now that the Lumada Data Catalog has been installed, you are ready to start planning, building, and using your data catalog.
- Use Data Catalog
- Use these articles to understand how to perform essential tasks in Lumada Data Catalog, such as searching the catalog, viewing lineage information, and tagging resources. This section is intended for non-administrative users of the catalog, including data analysts and data stewards.
- Manage
- Administrators can use these articles to learn the tasks in managing the Lumada Data Catalog, from setting up users and roles to monitoring jobs. These articles are intended for site administrators who are involved in post-installation and maintenance tasks, such as editing configuration properties for scripts or creating virtual folders. Some of these tasks may be performed by the owner of a data node or resource who knows the data well, such as managing jobs. In general, these tasks require an administrator (or data steward in some cases) who knows where the data is stored, how to connect to it, details about the computing environment, and how to use the command line to issue commands for Linux.
- Apache Ranger and Hive column-level security
- Data Catalog reserved names
- Manage agents
- Manage collections
- Manage custom properties
- Manage data sources
- Manage users
- Manage virtual folders
- Managing business glossaries
- Managing configurations
- Managing job templates
- Managing roles
- Managing rules
- Monitoring job activity
- Monitoring system activity and logs
- Role-based access control (RBAC)
- Search dimensions and custom facets
- Develop and deploy
- Support your system infrastructure, integrate with other systems, and access metadata in Lumada Data Catalog using the REST API. These sections are best used by catalog administrators, developers, and data scientists who are familiar with programming concepts and have extensive metadata experience.UtilitiesUse the Utilities articles to support your system infrastructure and effectively maintain the Lumada Data Catalog in your environment.IntegrationsUse the Integrations articles to learn how Lumada Data Catalog's plug-in framework can integrate with other applications, including data cleansing and visual tools. Additionally, Data Catalog provides adapters for applications focused on exporting terms and importing lineages.REST APILumada Data Catalog provides a REST API to access metadata held in the catalog. The same API allows applications to insert metadata such as property values, business terms, term associations, and lineage relationships. The API provides access to the same operations available from the LDC browser application.View the Lumada Data Catalog REST API documentation for details.