Components Reference
Pentaho aims to accommodate diverse computing environments. This list provides details about the environment components and versions we support.
If you have questions about your particular computing environment, please contact Pentaho Support.
Server
The Pentaho Server is hardware-independent and runs on server-class computers that comply with these specifications for minimum hardware and required operating systems:
Hardware—64 bit | Operating System—64 bit |
|
|
Workstation
These Pentaho design tools are hardware-independent and run on client-class computers that comply with these specifications for minimum hardware and required operation systems.
- Pentaho Aggregation Designer
- Pentaho Data Integration
- Pentaho Metadata Editor
- Pentaho Report Designer
- Pentaho Schema Workbench
Hardware—64 bit | Operating System—64 bit |
|
|
Embedded software
When embedding Pentaho software into other applications, the computing environment should comply with these specifications for minimum hardware and required operation systems.
- Embedded Pentaho Reporting
- Embedded Pentaho Analysis
- Embedded Pentaho Data Integration
Hardware—64 bit | Operating System—64 bit |
|
|
Application servers
Servers to which you deploy Pentaho software must run one of these application servers:
- JBoss EAP 7.0 with Oracle Java 8.x
- Tomcat 8.5 (default) with Oracle Java 8.x (as shipped)
Solution database repositories
Pentaho software stores processing artifacts in these database repositories:
- PostgreSQL 9.6.xx* & 10.0.xx
- MySQL 5.7.xx & 8.0.xx (SQL 92)
- Oracle 11.2, 12.1, & 12.2 (SQL 92) (including patched versions)
- MS SQL Server 2016 & 2017 (including patched versions)
* The default installed solution database.
Hadoop Vendors
Pentaho software supports data sources from these Hadoop Vendors. The Pentaho Adaptive Execution Layer is compatible with Spark 2.3 and 2.4.
Vendor | Driver Version | AEL Version |
Cloudera | 6.1, 6.2, 6.3 | 6.1, 6.2, 6.3 |
Hortonworks | 3.0*, 3.1 | 3.0, 3.1 |
EMR | 5.21*, 5.24 | 5.21, 5.24 |
MapR | No support | No support |
* Indicates that the Pentaho driver contains API updates from the vendor. |
Data Sources: General
Pentaho software supports the following data sources. Check this list if you are evaluating Pentaho or checking for general compatibility with a specific vendor.
Data Source | Supported Version |
Salesforce | 47 |
Amazon Redshift | 1.2.34.1058 |
Snowflake | 3.9.2 |
Hitachi Content Platform | 8.0.0.9 |
Pentaho Tools
This table summarizes which data sources are compatible with the main Pentaho tools.
Pentaho Software | Data Source |
Pentaho Reporting |
|
Pentaho Server, Action Sequences |
|
Pentaho Data Integration |
|
* Use a JDBC 3.x or 4.x compliant driver that is compatible with SQL-92 standards when communicating with relational data sources. For a list of drivers to use with relational JDBC databases, see the JDBC drivers reference. |
Big Data Sources: General
Pentaho software supports the following Big Data sources. Check this list if you are evaluating Pentaho or checking for general compatibility with a specific vendor.
Data Source | Supported Version |
Amazon EMR (via Hive) | 5.21, 5.24 |
Cloudera (via Hive or Impala) | 6.1, 6.2, 6.3 |
Datastax | 4.6, 4.8 |
Google BigQuery | 1.2.2.1004 |
Google Dataproc | 1.4 |
Greenplum | 4.2, 4.3 |
Hortonworks (via Hive or Spark SQL) | 3.0, 3.1 |
MongoDB | 4.0.2 |
Netezza | 7.1, 7.2 |
SAP HANA | SPS |
Teradata | 14.10, 15.0 |
Vertica | 9.3.0.0 |
Big Data Sources: Details
This table shows the Big Data sources that are compatible with specific Pentaho tools.
Data Source | Versions | Analyzer | PIR/PDD | Pentaho Reporting | DSW | PDIServer/Client | PRD | PSW | PME |
Amazon EMR | 5.21, 5.24f | Yes | Yes | No | No | Yes | Yes | No | Yes |
Cloudera | 6.1, 6.2, 6.3a (for job execution) | No | No | No | No | Yes | Yes | No | Yes |
via Impalab (as data source) | Yes | Yes | Yes | Yes | Yes | Yes | No | Yes | |
via Hive2c(as data source) | No | Yes | Yes | Yes | Yes | Yes | No | Yes | |
Datastax | 4.6, 4.8 | No | No | No | No | Yes | No | No | No |
Google BigQuery | 1.2.2.1004e | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Google Dataprocg (for job execution) | 1.4 | No | No | No | No | Yes | Yes | No | No |
via Hive2 and Google BigQuery (as data source) | Yes | Yes | Yes | Yes | Yes | Yes | No | Yes | |
Greenplum | 4.2, 4.3 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Hortonworks |
3.0, 3.1 (for job execution) | No | No | No | No | Yes | Yes | No | Yes |
via Hive2c(as data source) | No | Yes | Yes | Yes | Yes | Yes | No | Yes | |
via Spark SQLd (as data source) | No | No | No | No | Yes | No | No | No | |
MongoDB | 4.0.2 | No | No | Yes | No | Yes | Yes | No | No |
Netezza | 7.1, 7.2 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
SAP HANA | SPS | No | No | No | No | Yes | No | No | No |
Teradata | 14.10, 15.0 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Vertica | 9.3.0.0 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Notes: The following drivers are included in the Pentaho distribution for version 9.0: Cloudera Distribution for Hadoop 6.1, Amazon EMR 5.21, and Hortonworks 3.0. Other supported drivers can be downloaded from the Pentaho Customer Support Portal. a You must have the current version of the Pentaho release to use the CDH 6.1 driver. The CDH 6.1 driver requires the Impala JDBC Connector 2.6.4 Cloudera driver. CDH 6.1 requires Pentaho Service Pack 8.2.0.4 or later. The CDH 6.1 driver works with CDH 6.2 and CDH 6.3. b As with any data source, the performance of Pentaho Analyzer on Impala will be dependent upon the data shape, Impala’s configuration, and the types of queries. See the Customer Portal best practice article concerning Pentaho Analyzer on Impala for more information. c Hive2 as a data source for CDH also supports Hive on Spark. Hive2 as a data source for HDP also supports Hive on Tez. d The Simba Spark SQL driver needs to be downloaded, installed, and configured to be used as a data source for Hortonworks. See our instructions on downloading, installing, and configuring for more information. e The Simba driver required for Google BigQuery is the JDBC 4.2-compatible version. See https://cloud.google.com/bigquery/partners/simba-drivers/ . f Use the EMR 5.21 driver for your EMR 5.24 cluster. The EMR 5.21 driver is certified to work for EMR 5.24. g HBase is not supported with Google Dataproc. |
SQL dialect-specific
Pentaho software generates dialect-specific SQL when communicating with these data sources. Certified indicates the SQL dialect has been tested for compatibility with Pentaho.
Pentaho Software | Data Source |
Pentaho Analyzer |
Certified
Non-certified
|
Pentaho Metadata |
Certified
Non-certified
|
Pentaho Data Integration |
Certified
Non-certified
|
* If your data source is not in this list and is compatible with SQL-92, Pentaho software uses a generic SQL dialect. |
Security
Pentaho software integrates with these third-party security authentication systems:
- Active Directory
- CAS
- Integrated Microsoft Windows Authentication
- LDAP
- RDBMS
Java virtual machine
Pentaho software requirements for Java Runtime Environment (JRE). Certified indicates the JRE is tested and validated for each Pentaho release. Support is available for non-certified JREs.
Pentaho Software | Certified | Supported |
All Pentaho software |
|
|
Web browsers
Pentaho supports major versions of web browsers that are publicly available six weeks prior to the finalization of a Pentaho release, except as noted below.
Pentaho Software | Web Browser |
Pentaho User Console (PUC) (Pentaho recommends 2GB RAM for the web client.) |
|
Pentaho Report Designer |
|
Pentaho Data Integration (PDI) client* |
|
*Linux requires libwebkitgtk-1.0. See Use the Pentaho Installation Wizard to Install the PDI Client, Utilities, and Plugins for more information. |