Components Reference
Pentaho aims to accommodate diverse computing environments. This list provides details about the environment components and versions we support. Where applicable, versions are listed as certified or supported:
Certified
The version has been tested and validated for compatibility with Pentaho.
Supported
Support is available for listed non-certified versions.
If you have questions about your particular computing environment, contact Pentaho Support.
Hitachi Vantara products
The following Hitachi Vantara products are certified for Pentaho 9.5:
- Data Catalog 7.5.2
- Hitachi Content Platform 9.4
Server
The Pentaho Server is hardware-independent and runs on server-class computers that comply with these specifications for minimum hardware and required operating systems:
Hardware—64 bit | Operating System—64 bit | |
Certified | Supported | |
|
|
|
Container deployment
Supported technology for deploying Pentaho in containers.
Technology | Certified | Supported |
Docker | 20.10.10 | 20.x |
Workstation
These Pentaho design tools are hardware-independent and run on client-class computers that comply with these specifications for minimum hardware and required operation systems.
- Pentaho Aggregation Designer
- Pentaho Data Integration
- Pentaho Metadata Editor
- Pentaho Report Designer
- Pentaho Schema Workbench
Hardware—64 bit | Operating System—64 bit | |
Certified | Supported | |
|
|
|
Embedded software
When embedding Pentaho software into other applications, the computing environment should comply with these specifications for minimum hardware and required operation systems.
- Embedded Pentaho Reporting
- Embedded Pentaho Analysis
- Embedded Pentaho Data Integration
Hardware—64 bit | Operating System—64 bit | |
Certified | Supported | |
|
|
|
Application servers
Servers to which you deploy Pentaho software must run one of these application servers:
- JBoss EAP 7.3
- Tomcat 9.0.70 (Certified) 10.1 (Supported)
Solution database repositories
Pentaho software stores processing artifacts in these database repositories:
Certified | Supported |
PostgreSQL 14 | PostgreSQL 12.x & 13.x * |
MySQL 8.026 | MySQL 5.7 |
Oracle 21c | Oracle 19c & 21c (including patched versions) |
MS SQL Server 2019 | Microsoft SQL Server 2017 & 2019 (including patched versions) |
* The default installed solution database.
Apache Hadoop vendors
Pentaho software has certified or supported data sources from these Hadoop Vendors.
Vendor | Driver Version |
Cloudera Data Platform (CDP) Private Cloud | 7.17 (Supported) |
Cloudera Data Platform (CDP) Public Cloud | 7.2 |
Google Dataproc | 2.1 |
EMR | 5.36 |
Microsoft Azure HDInsight | 4.0 |
Data Sources: General
Pentaho software supports the following data sources. Check this list if you are evaluating Pentaho or checking for general compatibility with a specific vendor.
Data Source | Certified | Supported |
Salesforce | 54 | 54.x |
Amazon Redshift | 1.2.34.1058 | 2.1 |
Snowflake | 3.13.29 | 3.13..10 |
Hitachi Content Platform | 9.4 | 9.4 |
Pentaho Tools
This table summarizes which data sources are compatible with the main Pentaho tools.
Pentaho Software | Data Source |
Pentaho Reporting |
|
Pentaho Server, Action Sequences |
|
Pentaho Data Integration |
|
* Use a JDBC 3.x or 4.x compliant driver that is compatible with SQL-92 standards when communicating with relational data sources. For a list of drivers to use with relational JDBC databases, see the JDBC drivers reference. |
Big Data Sources: General
Pentaho software supports the following Big Data sources. Check this list if you are evaluating Pentaho or checking for general compatibility with a specific vendor.
Data Source | Supported Version |
Amazon EMR (via Hive) | 5.21, 5.24, and 5.36 (Certified) |
Cloudera Data Platform (via Hive or Impala) | 7.1.x |
Cloudera Data Platform (Public cloud) | 7.2 (Certified) |
Datastax | 6.7 |
Google BigQuery | 1.2.25 |
Google Dataproc | 2.1 |
Greenplum | 4.2, 4.3 |
Microsoft Azure HDInsight | 4.0 |
MongoDB | 4.4, 4.4.6 |
Netezza | 7.1, 7.2 |
SAP HANA | SPS |
Teradata | 16.20 |
Vertica | 11 |
Big Data Sources: Details
This table shows the Big Data sources that are compatible with specific Pentaho tools.
Data Source | Versions | Analyzer | PIR/PDD | Pentaho Reporting | DSW | PDIServer/Client | PRD | PSW | PME |
Amazon EMRa | 5.21, 5.24, 5.36 | Yes | Yes | No | No | Yes | Yes | No | Yes |
Cloudera Data Platform (CDP) Private Cloud | 7.1.7 (for job execution) | No | No | No | No | Yes | Yes | No | Yes |
via Impalab (as data source) | Yes | Yes | Yes | Yes | Yes | Yes | No | Yes | |
via Hive3c (as data source) | No | Yes | Yes | Yes | Yes | Yes | No | Yes | |
Datastax | 4.6, 4.8 | No | No | No | No | Yes | No | No | No |
Google BigQuery | 1.2.25d | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Google Dataproce (for job execution) | 1.4, 2.2f | No | No | No | No | Yes | Yes | No | No |
via Hive2 and Google BigQuery (as data source) | Yes | Yes | Yes | Yes | Yes | Yes | No | Yes | |
Greenplum | 4.2, 4.3 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Microsoft Azure HDInsight | 4.0 | Yes | Yes | No | No | Yes | No | No | Yes |
MongoDB | 4.4 | No | No | Yes | No | Yes | Yes | No | No |
Netezza | 7.1, 7.2 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
SAP HANA | SPS | No | No | No | No | Yes | No | No | No |
Teradata | 16.20 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Vertica | 10 & 11 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Notes: A generic Apache Hadoop driver is included in the Pentaho distribution
for version 9.5: Other supported drivers can be downloaded from the Support Portal. a Use the EMR 5.21 driver for your EMR 5.24 or EMR 5.36 cluster. The EMR 5.21 driver is certified to work for EMR 5.24. b You must have the current version of the Pentaho release to use the CDP 7.1.4 driver. The CDP 7.1.4 driver requires the Impala JDBC Connector 2.6.4 Cloudera driver. c Hive3 as a data source for CDP also supports Hive LLAP, and Hive3 on Tez. d The Simba driver required for Google BigQuery is the JDBC 4.2-compatible version. See https://cloud.google.com/bigquery/partners/simba-drivers/ . e HBase is not supported with Google Dataproc. f Use the Google Dataproc 1.8 driver for your Google Dataproc 2.2 cluster. The Google Dataproc 1.8 driver is certified to work for Google Dataproc 2.2. |
SQL dialect-specific
Pentaho software generates dialect-specific SQL when communicating with these data sources. Certified indicates the SQL dialect has been tested for compatibility with Pentaho.
Pentaho Software | Data Source |
Pentaho Analyzer |
Certified
Supported
|
Pentaho Metadata |
Certified
Supported
|
Pentaho Data Integration |
Certified
Supported
|
* If your data source is not in this list and is compatible with SQL-92, Pentaho software uses a generic SQL dialect. |
Third-party libraries
Pentaho software is compatible with the following third-party web framework, file system, engine, and utility libraries:
- AngularJS 1.8.0 (Supported)
- HTTPClient 4.5.9
- Apache VFS 2.7.0
- Apache Axis2 1.7.9
- Apache Log4j 2.17.1
- AWS SDK library 20.10.10 (Certified)
Security
Pentaho software integrates with these third-party security authentication systems:
- Active Directory
- CAS 5.x and CAS 6.5 (Certified)
- Integrated Microsoft Windows Authentication
- LDAP
- RDBMS
Java virtual machine
Pentaho software requirements for Java Runtime Environment (JRE).
Pentaho Software | Certified | Supported |
All Pentaho software |
|
|
Web browsers
Pentaho supports major versions of web browsers that are publicly available six weeks prior to the finalization of a Pentaho release.
Certified Browsers | Supported Browsers |
|
|
*Linux requires libwebkitgtk-1.0. See Use the Pentaho Installation Wizard to Install the PDI Client, Utilities, and Plugins for more information. |