Lumada Data Catalog 6.1
Welcome to Lumada Data Catalog 6.1. Lumada Data Catalog automates discovery, classification, and management of your enterprise data. Data Catalog services big data, data warehouses, cloud services, and databases across your enterprise. Its patented fingerprinting AI and machine learning capabilities surface data that matters, laying the foundation for efficient and successful analytics, data governance, and compliance.
- Release highlights
- Lumada Data Catalog version 6.1 delivers a variety of features and enhancements, including Data Rationalization and search improvements. This release also continues to enhance property configuration and the rules interface experience.
- Product overview
- The Data Catalog software builds a metadata catalog from data assets residing in Apache HDFS??? and Hive???, S3, MySQL???, Oracle??, Amazon Redshift??, Teradata??, etc. It profiles the data assets to produce field-level data quality statistics and to identify representative data so users can efficiently analyze the content and quality of the data.
- Install
- This article walks you through the installation of Lumada Data Catalog , including pre-installation and post-installation tasks.
- Apache Solr configuration
- Component validations
- Configure the Data Catalog service user
- Installing Lumada Data Catalog
- Installing Lumada Data Catalog on Amazon EMR
- Installing Lumada Data Catalog on Azure HDInsight
- Installing Lumada Data Catalog on CDH, HDP, or CDP
- Installing Lumada Data Catalog on GCP Dataproc
- Installing Lumada Data Catalog on MapR
- Installing standalone Solr
- LDAP Configuration
- OIDC with OAuth support
- Post-Installation steps
- Post-install system configurations
- Pre-Installation steps
- Single sign-on with SAML
- Solr on CDH and CDP
- Solr on HDP
- SSL configuration
- System requirements
- Uninstalling Lumada Data Catalog
- Upgrade
- You can upgrade Lumada Data Catalog from version 5.0, 2019.1, and 2019.3 to the latest version of 6.x.If you are upgrading from 2019.3, see Upgrading to Lumada Data Catalog 6.x for instructions.If you are upgrading from 5.0 or 2019.1, follow this process:Upgrade to 2019.3 (see Upgrading to 2019.3 for instructions).Upgrade to 6.1 (see Upgrading to Lumada Data Catalog 6.x for instructions.
- Get started
- Now that the Lumada Data Catalog has been installed, you are ready to start planning, building, and using your data catalog.
- Use Data Catalog
- Use these articles to understand how to perform essential tasks in Lumada Data Catalog, such as searching the catalog, viewing lineage information, and tagging resources. This section is intended for non-administrative users of the catalog, including data analysts and data stewards.
- Browsing Data Catalog assets
- Business entities
- Data Catalog overview
- Data preview
- Data rationalization
- Discussions and notifications
- Field browsing
- Lineage and origins
- Managing datasets
- Managing data objects
- Managing jobs
- Quick Start
- Resource properties
- Searching Data Catalog
- Single resource view details
- Tagging resources and fields
- Touring the DataOps Dashboard
- Manage
- Administrators can use these articles to learn the tasks in managing the Lumada Data Catalog, from setting up users and roles to monitoring jobs. These articles are intended for site administrators who are involved in post-installation and maintenance tasks, such as editing configuration properties for scripts or creating virtual folders. Some of these tasks may be performed by the owner of a data node or resource who knows the data well, such as managing jobs. In general, these tasks require an administrator (or data steward in some cases) who knows where the data is stored, how to connect to it, details about the computing environment, and how to use the command line to issue commands for Linux.
- Apache Ranger and Hive column-level security
- Collections
- Data Catalog reserved names
- Manage agents
- Manage Datasets
- Manage data sources
- Manage users
- Manage virtual folders
- Managing configurations
- Managing custom properties
- Managing data objects
- Managing job templates
- Managing roles
- Managing rules
- Managing tag domains
- Managing the metadata server
- Monitoring job activity
- Monitoring system activity and logs
- Role-based access control (RBAC)
- Search dimensions and custom facets
- Search ranking with query boosting
- Develop and deploy
- Support your system infrastructure, integrate with other systems, and access metadata in Lumada Data Catalog using the REST API. These sections are best used by catalog administrators, developers, and data scientists who are familiar with programming concepts and have extensive metadata experience.UtilitiesUse the Utilities articles to support your system infrastructure and effectively maintain the Lumada Data Catalog in your environment.IntegrationsUse the Integrations articles to learn how Lumada Data Catalog's plug-in framework can integrate with other applications, including data cleansing and visual tools. Additionally, Data Catalog provides adapters for applications focused on exporting tags and importing lineages.REST APILumada Data Catalog provides a REST API to access metadata held in the catalog. The same API allows applications to insert metadata such as property values, tags, tag associations, and lineage relationships. The API provides access to the same operations available from the LDC browser application.View the Lumada Data Catalog REST API documentation for details.