Components Reference
Pentaho aims to accommodate diverse computing environments. This list provides details about the environment components and versions we support. Where applicable, versions are listed as certified or supported:
-
Certified
The version has been tested and validated for compatibility with Pentaho.
-
Supported
Support is available for listed non-certified versions.
If you have questions about your particular computing environment, contact Pentaho Support.
Hitachi Vantara products
The following Hitachi Vantara products are certified for Pentaho 9.5:
- Pentaho Data Catalog 7.7
- Hitachi Content Platform 9.6
Server
The Pentaho Server is hardware-independent and runs on server-class computers that comply with these specifications for minimum hardware and required operating systems:
Hardware—64 bit | Operating System—64 bit | |
Certified | Supported | |
|
|
|
Container deployment
Supported technology for deploying Pentaho in containers.
Technology | Certified | Supported |
Docker | 20.10.10 | 20.x |
Workstation
These Pentaho design tools are hardware-independent and run on client-class computers that comply with these specifications for minimum hardware and required operation systems.
- Pentaho Aggregation Designer
- Pentaho Data Integration
- Pentaho Metadata Editor
- Pentaho Report Designer
- Pentaho Schema Workbench
Hardware—64 bit | Operating System—64 bit | |
Certified | Supported | |
|
|
|
Embedded software
When embedding Pentaho software into other applications, the computing environment should comply with these specifications for minimum hardware and required operation systems.
- Embedded Pentaho Reporting
- Embedded Pentaho Analysis
- Embedded Pentaho Data Integration
Hardware—64 bit | Operating System—64 bit | |
Certified | Supported | |
|
|
|
Application servers
Servers to which you deploy Pentaho software must run one of these application servers:
- JBoss EAP 7.3
- Tomcat 9.0.74 (Certified)
Solution database repositories
Pentaho software stores processing artifacts in these database repositories:
Certified | Supported |
PostgreSQL 14 | PostgreSQL 12.x & 13.x * |
MySQL 8.026 | MySQL 5.7 |
Oracle 21c | Oracle 19c & 21c (including patched versions) |
MS SQL Server 2019 | Microsoft SQL Server 2017 & 2019 (including patched versions) |
* The default installed solution database.
Apache Hadoop vendors
Pentaho software has certified or supported data sources from these Hadoop Vendors.
Vendor | Driver Version |
Cloudera Data Platform (CDP) Private Cloud | 7.1.x |
Cloudera Data Platform (CDP) Public Cloud | 7.2 |
Google Dataproc | 2.1 |
EMR | 5.36 |
Microsoft Azure HDInsight | 4.0 |
Data Sources: General
Pentaho software supports the following data sources. Check this list if you are evaluating Pentaho or checking for general compatibility with a specific vendor.
Data Source | Certified | Supported |
Salesforce | 54 | 54.x |
Amazon Redshift | 1.2.34.1058 | 2.1 |
Snowflake | 3.13.30 | 3.13.10 |
Pentaho Tools
This table summarizes which data sources are compatible with the main Pentaho tools.
Pentaho Software | Data Source |
Pentaho Reporting |
|
Pentaho Server, Action Sequences |
|
Pentaho Data Integration |
|
* Use a JDBC 3.x or 4.x compliant driver that is compatible with SQL-92 standards when communicating with relational data sources. For a list of drivers to use with relational JDBC databases, see the JDBC drivers reference. |
Big Data Sources: General
Pentaho software supports the following Big Data sources. Check this list if you are evaluating Pentaho or checking for general compatibility with a specific vendor.
Data Source | Supported Version |
Amazon EMR (via Hive) | 5.21, 5.24, and 5.36 (Certified) |
Cloudera Data Platform (via Hive or Impala) | 7.1.x |
Cloudera Data Platform (Public cloud) | 7.2 (Certified) |
Datastax | 6.7 |
Google BigQuery | 1.2.25 |
Google Dataproc | 2.1 |
Greenplum | 4.2, 4.3 |
Microsoft Azure HDInsight | 4.0 |
MongoDB | 4.4, 4.4.6 |
Netezza | 7.1, 7.2 |
SAP HANA | SPS |
Teradata | 16.20 |
Vertica | 11 |
Big Data Sources: Details
This table shows the Big Data sources that are compatible with specific Pentaho tools.
Data Source | Versions | Analyzer | PIR/PDD | Pentaho Reporting | DSW | PDIServer/Client | PRD | PSW | PME |
Amazon EMRa | 5.21, 5.24, 5.36 | Yes | Yes | No | No | Yes | Yes | No | Yes |
Cloudera Data Platform (CDP) Private Cloud | 7.1.7 (for job execution) | No | No | No | No | Yes | Yes | No | Yes |
via Impalab (as data source) | Yes | Yes | Yes | Yes | Yes | Yes | No | Yes | |
via Hive3c (as data source) | No | Yes | Yes | Yes | Yes | Yes | No | Yes | |
Datastax | 4.6, 4.8 | No | No | No | No | Yes | No | No | No |
Google BigQuery | 1.2.25d | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Google Dataproce (for job execution) | 1.4, 2.2f | No | No | No | No | Yes | Yes | No | No |
via Hive2 and Google BigQuery (as data source) | Yes | Yes | Yes | Yes | Yes | Yes | No | Yes | |
Greenplum | 4.2, 4.3 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Microsoft Azure HDInsight | 4.0 | Yes | Yes | No | No | Yes | No | No | Yes |
MongoDB | 4.4 | No | No | Yes | No | Yes | Yes | No | No |
Netezza | 7.1, 7.2 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
SAP HANA | SPS | No | No | No | No | Yes | No | No | No |
Teradata | 16.20 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Vertica | 10 & 11 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Notes: A generic Apache Hadoop driver is included in the Pentaho distribution for version 9.5: Other supported drivers can be downloaded from the Support Portal.
a Use the EMR 5.21 driver for your EMR 5.24 or EMR 5.36 cluster. The EMR 5.21 driver is certified to work for EMR 5.24. b You must have the current version of the Pentaho release to use the CDP 7.1.4 driver. The CDP 7.1.4 driver requires the Impala JDBC Connector 2.6.4 Cloudera driver. c Hive3 as a data source for CDP also supports Hive LLAP, and Hive3 on Tez. d The Simba driver required for Google BigQuery is the JDBC 4.2-compatible version. See https://cloud.google.com/bigquery/partners/simba-drivers/ . e HBase is not supported with Google Dataproc. f Use the Google Dataproc 1.8 driver for your Google Dataproc 2.2 cluster. The Google Dataproc 1.8 driver is certified to work for Google Dataproc 2.2. |
SQL dialect-specific
Pentaho software generates dialect-specific SQL when communicating with these data sources. Certified indicates the SQL dialect has been tested for compatibility with Pentaho.
Pentaho Software | Data Source |
Pentaho Analyzer |
Certified
Supported
|
Pentaho Metadata |
Certified
Supported
|
Pentaho Data Integration |
Certified
Supported
|
* If your data source is not in this list and is compatible with SQL-92, Pentaho software uses a generic SQL dialect. |
Third-party libraries
Pentaho software is compatible with the following third-party web framework, file system, engine, and utility libraries:
- AngularJS 1.8.0 (Supported)
- Apache Axis 2 1.7.9
- Apache Kafka Client 3.4
- Apache Log4j 2.17.1
- Apache VFS 2.7.0
- HTTPClient 4.5.9
- JackRabbit 2.16.x
Security
Pentaho software integrates with these third-party security authentication systems:
- Active Directory
- CAS 5.x and CAS 6.5 (Certified)
- Integrated Microsoft Windows Authentication
- LDAP
- RDBMS
Java virtual machine
Pentaho software requirements for Java Runtime Environment (JRE).
Pentaho Software | Certified | Supported |
All Pentaho software |
|
|
- Some Hadoop clusters using Java 8 may not be fully compatible when running Pentaho with Java 11.
- The PDI client requires Java 11.x to run on Windows 11.
Web browsers
Pentaho supports major versions of web browsers that are publicly available six weeks prior to the finalization of a Pentaho release.
Certified Browsers | Supported Browsers |
|
|
*Linux requires libwebkitgtk-1.0. See Use the Pentaho Installation Wizard to Install the PDI Client, Utilities, and Plugins for more information. |