Skip to main content
- Overview
- You can use Pentaho Data Storage Optimizer to inventory stored data, identify content, usage, and tier files into long term or archival storage. You can take rule-driven actions about data lifecycles to account for compliance, cost, and mitigate risks, using a set of convenient tools and self-service processes for sustainable improvements in data management. This applies regardless of the vendor and across local, cloud, and core environments. If needed, you can restore tiered files at any time.
- Get started
- Data Storage Optimizer integrates with the Pentaho Data Catalog application. After installing Data Storage Optimizer, you are ready to start identifying, classifying, and tiering or purging your files.
- Use Data Storage Optimizer
- Use these articles to understand and perform essential tasks in Data Storage Optimizer, including importing data sources, and classifying, tiering, and rehydrating files.
- Management
- You can use the features on the Manage Your Environment page to do the following:View status and import supported data sources into Data Storage Optimizer from available sources in Data Catalog across NFS, SMB/CIFS, HDFS, S3, Cloud object stores and file shares. Monitor data operations.Apply rules-based governance on storage location, life cycle, retention, and tiering or purging.
- Install
- This chapter explains the installation and configuration of Pentaho Data Storage Optimizer. Pentaho Data Storage Optimizer is installed in parallel with Pentaho Data Catalog.