Skip to main content

Pentaho+ documentation has moved!

The new product documentation portal is here. Check it out now at docs.hitachivantara.com

 

Hitachi Vantara Lumada and Pentaho Documentation

Components Reference

 

Parent article

Pentaho aims to accommodate diverse computing environments. This list provides details about the environment components and versions we support. Where applicable, versions are listed as certified or supported:

  • Certified

    The version has been tested and validated for compatibility with Pentaho.

  • Supported

    Support is available for listed non-certified versions.

If you have questions about your particular computing environment, contact Pentaho Support.

Hitachi Vantara products

 

The following Hitachi Vantara products are certified for Pentaho 9.5:

  • Pentaho Data Catalog 7.7
  • Hitachi Content Platform 9.6

Server

 

The Pentaho Server is hardware-independent and runs on server-class computers that comply with these specifications for minimum hardware and required operating systems:

Hardware—64 bit Operating System—64 bit
Certified Supported
  • Processor

    Intel EM64T or AMD64 Dual-Core

  • RAM

    8 GB with 4 GB dedicated to Pentaho servers

  • Disk Space

    20 GB free after installation

  • Microsoft Windows 2022 Server
  • CentOS 8
  • Red Hat Enterprise 8
  • Ubuntu Server 20.04 LTS
  • Microsoft Windows 2019 Server & 2022 Server
  • CentOS 7 & 8
  • Red Hat Enterprise 7.x & 8.x
  • Ubuntu Server 18.x LTS & 20.x LTS

Container deployment

 

Supported technology for deploying Pentaho in containers.

Technology Certified Supported
Docker 20.10.10 20.x

Workstation

 

These Pentaho design tools are hardware-independent and run on client-class computers that comply with these specifications for minimum hardware and required operation systems.

  • Pentaho Aggregation Designer
  • Pentaho Data Integration
  • Pentaho Metadata Editor
  • Pentaho Report Designer
  • Pentaho Schema Workbench

 

Hardware—64 bit Operating System—64 bit
Certified Supported
  • Processors

     

    • Apple Macintosh Dual-Core
    • Intel EM64T or AMD64 Dual-Core
  • RAM

    2 GB RAM for most of the design tools, PDI requires 2 GB dedicated

  • Disk Space

    2 GB free after installation

  • Minimum Screen Size

    1280 x 960

  • Ubuntu Desktop 20.04
  • Microsoft Windows 10
  • macOS 11 (Big Sur)
  • Ubuntu Desktop 18.x & 20.x
  • Microsoft Windows 10
  • macOS 10.15 (Catalina) & 11 (Big Sur)

Embedded software

 

When embedding Pentaho software into other applications, the computing environment should comply with these specifications for minimum hardware and required operation systems.

  • Embedded Pentaho Reporting
  • Embedded Pentaho Analysis
  • Embedded Pentaho Data Integration
Hardware—64 bit Operating System—64 bit
Certified Supported
  • Processors

    Intel EM64T or AMD64 Dual-Core

  • RAM

    8 GB with 4 GB dedicated to Pentaho servers

  • Disk Space

    20 GB free after installation

  • Microsoft Windows 2022 Server
  • CentOS 8
  • Red Hat Enterprise 8
  • Ubuntu Server 20.04 LTS
  • Microsoft Windows 2019 Server & 2022 Server
  • CentOS 7 & 8
  • Red Hat Enterprise 7.x & 8.x
  • Ubuntu Server 18.x LTS & 20.x LTS

Application servers

 

Servers to which you deploy Pentaho software must run one of these application servers:

  • JBoss EAP 7.3
  • Tomcat 9.0.xx (shipped with 9.0.50) with Oracle Java 11.x (shipped with 11.0.13)

Solution database repositories

 

Pentaho software stores processing artifacts in these database repositories:

Certified Supported
PostgreSQL 14 PostgreSQL 12.x & 13.x *
MySQL 8.026 MySQL 5.7
Oracle 21c Oracle 19c & 21c (including patched versions)
MS SQL Server 2019 Microsoft SQL Server 2017 & 2019 (including patched versions)

* The default installed solution database.

Apache Hadoop vendors

 

Pentaho software supports data sources from these Hadoop Vendors. The Pentaho Adaptive Execution Layer is compatible with Spark 2.3 and 2.4.

Vendor Driver Version AEL Version
Cloudera (CDH) 6.1, 6.2, 6.3 6.1, 6.2, 6.3
Cloudera Data Platform (CDP) Private Cloud 7.1.X 7.1.X
Google Dataproc 1.4 1.4
Hortonworks (HDP) 3.0*, 3.1 3.0, 3.1
EMR 5.21*, 5.24 5.21, 5.24
Microsoft Azure HDInsight 4.0 4.0
* Indicates that the Pentaho driver contains API updates from the vendor.

Data Sources: General

 

Pentaho software supports the following data sources. Check this list if you are evaluating Pentaho or checking for general compatibility with a specific vendor.

Data Source Certified Supported
Salesforce 54 54.x
Amazon Redshift 1.2.34.1058 1.2.34.x
Snowflake 3.13.7 3.13.x
Hitachi Content Platform 8.0.0.9 8.0.0.x

Pentaho Tools

 

This table summarizes which data sources are compatible with the main Pentaho tools.

Pentaho Software Data Source
Pentaho Reporting
  • JDBC 3/4*
  • ODBC
  • OLAP4J
  • XML
  • Pentaho Analysis
  • Pentaho Data Integration
  • Pentaho Metadata
  • Scriptable
  • Snowflake
Pentaho Server, Action Sequences
  • Relational (JDBC)
  • Hibernate
  • Javascript
  • Metadata (MQL)
  • Mondrian (MDX)
  • XML (XQuery)
  • Security User/Role List Provider
  • Snowflake
  • Data Integration Steps (PDI)
  • Other Action Sequences
  • Web Services
  • XMLA
Pentaho Data Integration
  • JDBC 3/4*
  • OLAP4J
  • Salesforce
  • Snowflake
  • XML
  • CSV
  • Microsoft Excel
* Use a JDBC 3.x or 4.x compliant driver that is compatible with SQL-92 standards when communicating with relational data sources. For a list of drivers to use with relational JDBC databases, see the JDBC drivers reference.

Big Data Sources: General

 

Pentaho software supports the following Big Data sources. Check this list if you are evaluating Pentaho or checking for general compatibility with a specific vendor.

Data Source Supported Version
Amazon EMR (via Hive) 5.21, 5.24, 5.32
Cloudera (via Hive or Impala) 6.1, 6.2, 6.3
Cloudera Data Platform (via Hive or Impala) 7.1.x
Datastax 4.6, 4.8
Google BigQuery 1.2.2.1004
Google Dataproc 1.4, 2.2
Greenplum 4.2, 4.3
Hortonworks (via Hive or Spark SQL) 3.0, 3.1
Microsoft Azure HDInsight 4.0
MongoDB 4.0.2
Netezza 7.1, 7.2
SAP HANA SPS
Teradata 14.10, 15.0
Vertica 9.3.0.0

Big Data Sources: Details

 

This table shows the Big Data sources that are compatible with specific Pentaho tools.

Data Source Versions Analyzer PIR/PDD Pentaho Reporting DSW PDIServer/Client PRD PSW PME
Amazon EMR 5.21, 5.24, 5.32f Yes Yes No No Yes Yes No Yes
Cloudera 6.1, 6.2, 6.3a (for job execution) No No No No Yes Yes No Yes
via Impalab (as data source) Yes Yes Yes Yes Yes Yes No Yes
via Hive2c(as data source) No Yes Yes Yes Yes Yes No Yes
Cloudera Data Platform 7.1.x (for job execution) No No No No Yes Yes No Yes
via Impalah (as data source) Yes Yes Yes Yes Yes Yes No Yes
via Hive3i (as data source) No Yes Yes Yes Yes Yes No Yes
Datastax 4.6, 4.8 No No No No Yes No No No
Google BigQuery 1.2.2.1004e Yes Yes Yes Yes Yes Yes Yes Yes
Google Dataprocg (for job execution) 1.4, 2.2j No No No No Yes Yes No No
via Hive2 and Google BigQuery (as data source) Yes Yes Yes Yes Yes Yes No Yes
Greenplum 4.2, 4.3 Yes Yes Yes Yes Yes Yes Yes Yes
Hortonworks

3.0, 3.1

(for job execution)

No No No No Yes Yes No Yes
via Hive2c(as data source) No Yes Yes Yes Yes Yes No Yes

via Spark SQLd

(as data source)

No No No No Yes No No No
Microsoft Azure HDInsight 4.0 Yes Yes No No Yes No No Yes
MongoDB 4.0.2 No No Yes No Yes Yes No No
Netezza 7.1, 7.2 Yes Yes Yes Yes Yes Yes Yes Yes
SAP HANA SPS No No No No Yes No No No
Teradata 14.10, 15.0 Yes Yes Yes Yes Yes Yes Yes Yes
Vertica 9.3.0.0 Yes Yes Yes Yes Yes Yes Yes Yes
Notes: A generic Apache Hadoop driver is included in the Pentaho distribution for version 9.3: Other supported drivers can be downloaded from the Hitachi Vantara Lumada and Pentaho Support Portal.

a You must have the current version of the Pentaho release to use the CDH 6.1 driver. The CDH 6.1 driver requires the Impala JDBC Connector 2.6.4 Cloudera driver. CDH 6.1 requires Pentaho Service Pack 8.2.0.4 or later. The CDH 6.1 driver works with CDH 6.2 and CDH 6.3.

b As with any data source, the performance of Pentaho Analyzer on Impala will be dependent upon the data shape, Impala’s configuration, and the types of queries. See the Customer Portal best practice article concerning Pentaho Analyzer on Impala for more information.

c Hive2 as a data source for CDH also supports Hive on Spark. Hive2 as a data source for HDP also supports Hive on Tez.

d The Simba Spark SQL driver needs to be downloaded, installed, and configured to be used as a data source for Hortonworks. See our Install Pentaho Data Integration and Analytics document for more information.

e The Simba driver required for Google BigQuery is the JDBC 4.2-compatible version. See https://cloud.google.com/bigquery/partners/simba-drivers/ .

f Use the EMR 5.21 driver for your EMR 5.24 or EMR 5.32 cluster. The EMR 5.21 driver is certified to work for EMR 5.24 and EMR 5.32.

g HBase is not supported with Google Dataproc.

h You must have the current version of the Pentaho release to use the CDP 7.1.4 driver. The CDP 7.1.4 driver requires the Impala JDBC Connector 2.6.4 Cloudera driver.

i Hive3 as a data source for CDP also supports Hive LLAP, and Hive3 on Tez.

j Use the Google Dataproc 1.8 driver for your Google Dataproc 2.2 cluster. The Google Dataproc 1.8 driver is certified to work for Google Dataproc 2.2.

SQL dialect-specific

 

Pentaho software generates dialect-specific SQL when communicating with these data sources. Certified indicates the SQL dialect has been tested for compatibility with Pentaho.

Pentaho Software Data Source
Pentaho Analyzer

Certified

  • Amazon Redshift
  • Azure SQL
  • Impala
  • MySql
  • Microsoft SQL Server
  • Oracle
  • PostgreSQL
  • Snowflake

Non-certified

  • Access
  • Derby
  • Firebird
  • Greenplum
  • Hsqldb
  • IBM DB2
  • IBM MQ 9.2
  • Infobright
  • Informix
  • Ingres
  • Interbase
  • MonetDB
  • Neoview
  • Netezza
  • SqlStream
  • Sybase
  • Teradata
  • Vectorwise
  • Vertica
  • Other SQL-89 compliant*
Pentaho Metadata

Certified

  • Azure SQL
  • Hive 2
  • Impala
  • MySQL
  • PostgreSQL

Non-certified

  • Amazon Redshift
  • ASSQL
  • Firebird
  • H2
  • Hypersonic
  • IBM DB2
  • IBM MQ 9.2
  • Ingres
  • Interbase
  • MS Access
  • MS SQL Server (JTDS Driver)
  • MS SQL Server (Microsoft Driver)
  • Netezza
  • Oracle
  • PostgreSQL
  • Snowflake
  • Sybase
  • Vertica
  • Other SQL-92 compliant*
Pentaho Data Integration

Certified

  • Amazon Redshift
  • Azure SQL
  • Hive
  • Hive 2
  • Impala
  • MS SQL Server (JTDS Driver)
  • MS SQL Server (Microsoft Driver)
  • MySQL
  • Oracle
  • PostgreSQL
  • Snowflake
  • Vertica

Non-certified

  • Apache Derby
  • AS/400
  • InfiniDB
  • Exasol 4
  • Firebird SQL
  • Greenplum
  • H2
  • Hypersonic
  • IBM DB2
  • IBM MQ 9.2
  • Infobright
  • Informix
  • Ingres
  • Ingres VectorWise
  • MaxDB (SAP DB)
  • MonetDB
  • Neoview
  • Netezza
  • Oracle RDB
  • SAP HANA
  • SQLite
  • Teradata
  • UniVerse database
  • Other SQL-92 compliant*
* If your data source is not in this list and is compatible with SQL-92, Pentaho software uses a generic SQL dialect.

Third-party libraries

 

Pentaho software is compatible with the following third-party web framework, file system, engine, and utility libraries:

  • AngularJS 1.7.8
  • HTTPClient 4.5.9
  • Apache VFS 2.3
  • Apache Axis2 1.7.9
  • Apache Log4j 2.17.1

Security

 

Pentaho software integrates with these third-party security authentication systems:

  • Active Directory
  • CAS 5.x
  • Integrated Microsoft Windows Authentication
  • LDAP
  • RDBMS

Java virtual machine

 

Pentaho software requirements for Java Runtime Environment (JRE).

Pentaho Software Certified Supported
All Pentaho software
  • Oracle Java 11.0.13
  • Oracle OpenJDK 11.0.13
  • Oracle Java 8.x & 11.x
  • Oracle OpenJDK 8.x & 11.x
  • AdoptOpenJDK
  • Zulu from Azul Systems

 

NoteSome Hadoop clusters using Java 8 may not be fully compatible when running Pentaho with Java 11, see Compatability issues running Pentaho on Java 11 with your Hadoop cluster for details.

Web browsers

 

Pentaho supports major versions of web browsers that are publicly available six weeks prior to the finalization of a Pentaho release.

Pentaho Software Certified Browsers Supported Browsers
Pentaho User Console (PUC)

(Pentaho recommends 2GB RAM for the web client.)

  • Apple Safari 15.3 (On macOS only)
  • Google Chrome 100.0.4896.127
  • Microsoft Edge 101.0.1210.32
  • Microsoft Internet Explorer 11 (Does not render PUC correctly using Compatibility Mode)
  • Mozilla Firefox 98.0.2
  • Apple Safari 15 (On macOS only) and later
  • Google Chrome 100 and later
  • Microsoft Edge 101 and later
  • Microsoft Internet Explorer 11 and later (Does not render PUC correctly using Compatibility Mode)
  • Mozilla Firefox 98 and later
Pentaho Report Designer
  • Apple Safari 15.3 (On macOS only)
  • Google Chrome 100.0.4896.127
  • Microsoft Edge 101.0.1210.32
  • Microsoft Internet Explorer 11 (Does not render PUC correctly using Compatibility Mode)
  • Mozilla Firefox 98.0.2
  • Apple Safari 15 (On macOS only) and later
  • Google Chrome 100 and later
  • Microsoft Edge 101 and later
  • Microsoft Internet Explorer 11 and later (Does not render PUC correctly using Compatibility Mode)
  • Mozilla Firefox 98 and later
Pentaho Data Integration (PDI) client*
  • Apple Safari 15.3 (On macOS only)
  • Google Chrome 100.0.4896.127
  • Microsoft Edge 101.0.1210.32
  • Microsoft Internet Explorer 11 (Does not render PUC correctly using Compatibility Mode)
  • Mozilla Firefox 98.0.2
  • Apple Safari 15.3 (On macOS only) and later
  • Google Chrome 100 and later
  • Microsoft Edge 101 and later
  • Microsoft Internet Explorer 11 and later (Does not render PUC correctly using Compatibility Mode)
  • Mozilla Firefox 98 and later
*Linux requires libwebkitgtk-1.0. See Use the Pentaho Installation Wizard to Install the PDI Client, Utilities, and Plugins for more information.