Skip to main content

Pentaho+ documentation is moving!

The new product documentation portal is here. Check it out now at docs.hitachivantara.com

 

Hitachi Vantara Lumada and Pentaho Documentation

Components Reference

Parent article

Pentaho aims to accommodate diverse computing environments. This list provides details about the environment components and versions we support.

If you have questions about your particular computing environment, please contact Pentaho Support.

Server

The Pentaho Server is hardware-independent and runs on server-class computers that comply with these specifications for minimum hardware and required operating systems:

Hardware—64 bitOperating System—64 bit
  • Processor

    Intel EM64T or AMD64 Dual-Core

  • RAM

    8 GB with 4 GB dedicated to Pentaho servers

  • Disk Space

    20 GB free after installation

  • Microsoft Windows 2016 Server
  • CentOS 7 & 8
  • Red Hat Enterprise 7 & 8
  • Ubuntu Server 16.04 LTS & 18.04 LTS
  • SUSE Linux SLES 12

Workstation

These Pentaho design tools are hardware-independent and run on client-class computers that comply with these specifications for minimum hardware and required operation systems.

  • Pentaho Aggregation Designer
  • Pentaho Data Integration
  • Pentaho Metadata Editor
  • Pentaho Report Designer
  • Pentaho Schema Workbench

Hardware—64 bitOperating System—64 bit
  • Processors

    • Apple Macintosh Dual-Core
    • Intel EM64T or AMD64 Dual-Core
  • RAM

    2 GB RAM for most of the design tools, PDIPDI requires 2 GB dedicated

  • Disk Space

    2 GB free after installation

  • Minimum Screen Size

    1280 x 960

  • Microsoft Windows 8 & 10
  • Ubuntu Desktop 16.04 LTS & 18.04 LTS
  • macOS 10.13, 10.14, & 10.15

Embedded software

When embedding Pentaho software into other applications, the computing environment should comply with these specifications for minimum hardware and required operation systems.

  • Embedded Pentaho Reporting
  • Embedded Pentaho Analysis
  • Embedded Pentaho Data Integration
Hardware—64 bitOperating System—64 bit
  • Processors

    Intel EM64T or AMD64 Dual-Core

  • RAM

    8 GB with 4 GB dedicated to Pentaho servers

  • Disk Space

    20 GB free after installation

  • Microsoft Windows 2016 Server
  • CentOS 7 & 8
  • Red Hat Enterprise 7 & 8
  • Ubuntu Server 16.04 LTS & 18.04 LTS
  • SUSE Linux SLES 12

Application servers

Servers to which you deploy Pentaho software must run one of these application servers:

  • JBoss EAP 7.0 with Oracle Java 8.x
  • Tomcat 8.5 (default) with Oracle Java 8.x (as shipped)

Solution database repositories

Pentaho software stores processing artifacts in these database repositories:

CertifiedSupported
PostgreSQL 14PostgreSQL 12.x & 13.x *
MySQL 8.026MySQL 5.7
Oracle 21cOracle 19c & 21c (including patched versions)
MS SQL Server 2019Microsoft SQL Server 2017 & 2019 (including patched versions)

* The default installed solution database.

Hadoop Vendors

Pentaho software supports data sources from these Hadoop Vendors. The Pentaho Adaptive Execution Layer is compatible with Spark 2.3 and 2.4.

VendorDriver VersionAEL Version
Cloudera6.1, 6.2, 6.36.1, 6.2, 6.3
Cloudera Data Platform7.1.4--
Google Dataproc1.41.4
Hortonworks3.0*, 3.13.0, 3.1
EMR5.21*, 5.245.21, 5.24
MapRNo supportNo support
* Indicates that the Pentaho driver contains API updates from the vendor.

Data Sources: General

Pentaho software supports the following data sources. Check this list if you are evaluating Pentaho or checking for general compatibility with a specific vendor.

Data SourceSupported Version
Salesforce47
Amazon Redshift1.2.34.1058
Snowflake3.9.2
Hitachi Content Platform8.0.0.9

Pentaho Tools

This table summarizes which data sources are compatible with the main Pentaho tools.

Pentaho SoftwareData Source
Pentaho Reporting
  • JDBC 3/4*
  • ODBC
  • OLAP4J
  • XML
  • Pentaho Analysis
  • Pentaho Data Integration
  • Pentaho Metadata
  • Scriptable
  • Snowflake
Pentaho Server, Action Sequences
  • Relational (JDBC)
  • Hibernate
  • Javascript
  • Metadata (MQL)
  • Mondrian (MDX)
  • XML (XQuery)
  • Security User/Role List Provider
  • Snowflake
  • Data Integration Steps (PDI)
  • Other Action Sequences
  • Web Services
  • XMLA
Pentaho Data Integration
  • JDBC 3/4*
  • OLAP4J
  • Salesforce
  • Snowflake
  • XML
  • CSV
  • Microsoft Excel
* Use a JDBC 3.x or 4.x compliant driver that is compatible with SQL-92 standards when communicating with relational data sources. For a list of drivers to use with relational JDBC databases, see the JDBC drivers reference.

Big Data Sources: General

Pentaho software supports the following Big Data sources. Check this list if you are evaluating Pentaho or checking for general compatibility with a specific vendor.

Data SourceSupported Version
Amazon EMR (via Hive)5.21, 5.24
Cloudera (via Hive or Impala)6.1, 6.2, 6.3
Cloudera Data Platform (via Hive or Impala)7.1.4
Datastax4.6, 4.8
Google BigQuery1.2.2.1004
Google Dataproc1.4
Greenplum4.2, 4.3
Hortonworks (via Hive or Spark SQL)3.0, 3.1
MongoDB4.0.2
Netezza7.1, 7.2
SAP HANASPS
Teradata14.10, 15.0
Vertica9.3.0.0

Big Data Sources: Details

This table shows the Big Data sources that are compatible with specific Pentaho tools.

Data SourceVersionsAnalyzerPIR/PDDPentaho ReportingDSWPDIServer/ClientPRDPSWPME
Amazon EMR 5.21, 5.24fYesYesNoNoYesYesNoYes
Cloudera6.1, 6.2, 6.3a (for job execution)NoNoNoNoYesYesNoYes
via Impalab (as data source)YesYesYesYesYesYesNoYes
via Hive2c(as data source)NoYesYesYesYesYesNoYes
Cloudera Data Platform 7.1.4 (for job execution)NoNoNoNoYesYesNoYes
via Impalah (as data source)YesYesYesYesYesYesNoYes
via Hive3i (as data source)NoYesYesYesYesYesNoYes
Datastax4.6, 4.8NoNoNoNoYesNoNoNo
Google BigQuery1.2.2.1004eYesYesYesYesYesYesYesYes
Google Dataprocg (for job execution) 1.4NoNoNoNoYesYesNoNo
via Hive2 and Google BigQuery (as data source)YesYesYesYesYesYesNoYes
Greenplum4.2, 4.3YesYesYesYesYesYesYesYes
Hortonworks

3.0, 3.1

(for job execution)

NoNoNoNoYesYesNoYes
via Hive2c(as data source)NoYesYesYesYesYesNoYes

via Spark SQLd

(as data source)

NoNoNoNoYesNoNoNo
MongoDB4.0.2NoNoYesNoYesYesNoNo
Netezza7.1, 7.2YesYesYesYesYesYesYesYes
SAP HANASPSNoNoNoNoYesNoNoNo
Teradata14.10, 15.0YesYesYesYesYesYesYesYes
Vertica9.3.0.0YesYesYesYesYesYesYesYes
Notes: The following drivers are included in the Pentaho distribution for version 9.1: Cloudera Distribution for Hadoop 6.1, Amazon EMR 5.21, and Hortonworks 3.0. Other supported drivers can be downloaded from the Pentaho Customer Support Portal.

a You must have the current version of the Pentaho release to use the CDH 6.1 driver. The CDH 6.1 driver requires the Impala JDBC Connector 2.6.4 Cloudera driver. CDH 6.1 requires Pentaho Service Pack 8.2.0.4 or later. The CDH 6.1 driver works with CDH 6.2 and CDH 6.3.

b As with any data source, the performance of Pentaho Analyzer on Impala will be dependent upon the data shape, Impala’s configuration, and the types of queries. See the Customer Portal best practice article concerning Pentaho Analyzer on Impala for more information.

c Hive2 as a data source for CDH also supports Hive on Spark. Hive2 as a data source for HDP also supports Hive on Tez.

d The Simba Spark SQL driver needs to be downloaded, installed, and configured to be used as a data source for Hortonworks. See our instructions on downloading, installing, and configuring for more information.

e The Simba driver required for Google BigQuery is the JDBC 4.2-compatible version. See https://cloud.google.com/bigquery/partners/simba-drivers/ .

f Use the EMR 5.21 driver for your EMR 5.24 cluster. The EMR 5.21 driver is certified to work for EMR 5.24.

g HBase is not supported with Google Dataproc.

h You must have the current version of the Pentaho release to use the CDP 7.1.4 driver. The CDP 7.1.4 driver requires the Impala JDBC Connector 2.6.4 Cloudera driver.

i Hive3 as a data source for CDP also supports Hive LLAP, and Hive3 on Tez.

SQL dialect-specific

Pentaho software generates dialect-specific SQL when communicating with these data sources. Certified indicates the SQL dialect has been tested for compatibility with Pentaho.

Pentaho SoftwareData Source
Pentaho Analyzer

Certified

  • Amazon Redshift
  • Impala
  • MySql
  • Microsoft SQL Server
  • Oracle
  • PostgreSQL
  • Snowflake

Non-certified

  • Access
  • DB2
  • Derby
  • Firebird
  • Greenplum
  • Hsqldb
  • Infobright
  • Informix
  • Ingres
  • Interbase
  • MonetDB
  • Neoview
  • Netezza
  • SqlStream
  • Sybase
  • Teradata
  • Vectorwise
  • Vertica
  • Other SQL-89 compliant*
Pentaho Metadata

Certified

  • Hive 2
  • Impala
  • MySQL
  • PostgreSQL

Non-certified

  • Amazon Redshift
  • ASSQL
  • DB2
  • Firebird
  • H2
  • Hypersonic
  • Ingres
  • Interbase
  • MS Access
  • MS SQL Server (JTDS Driver)
  • MS SQL Server (Microsoft Driver)
  • Netezza
  • Oracle
  • PostgreSQL
  • Snowflake
  • Sybase
  • Vertica
  • Other SQL-92 compliant*
Pentaho Data Integration

Certified

  • Amazon Redshift
  • Hive
  • Hive 2
  • Impala
  • MS SQL Server (JTDS Driver)
  • MS SQL Server (Microsoft Driver)
  • MySQL
  • Oracle
  • PostgreSQL
  • Snowflake
  • Vertica

Non-certified

  • Apache Derby
  • AS/400
  • InfiniDB
  • Exasol 4
  • Firebird SQL
  • Greenplum
  • H2
  • Hypersonic
  • IBM DB2
  • Infobright
  • Informix
  • Ingres
  • Ingres VectorWise
  • MaxDB (SAP DB)
  • MonetDB
  • Neoview
  • Netezza
  • Oracle RDB
  • SAP HANA
  • SQLite
  • Teradata
  • UniVerse database
  • Other SQL-92 compliant*
* If your data source is not in this list and is compatible with SQL-92, Pentaho software uses a generic SQL dialect.

Security

Pentaho software integrates with these third-party security authentication systems:

  • Active Directory
  • CAS
  • Integrated Microsoft Windows Authentication
  • LDAP
  • RDBMS

Java virtual machine

Pentaho software requirements for Java Runtime Environment (JRE). Certified indicates the JRE is tested and validated for each Pentaho release. Support is available for non-certified JREs.

Pentaho SoftwareCertifiedSupported
All Pentaho software
  • Oracle Java 8 Update 251
  • Oracle OpenJDK 8 Update 252
  • AdoptOpenJDK
  • Zulu from Azul Systems

Web browsers

Pentaho supports major versions of web browsers that are publicly available six weeks prior to the finalization of a Pentaho release, except as noted below.

Pentaho SoftwareWeb Browser

Pentaho User Console (PUC)

(Pentaho recommends 2GB RAM for the web client.)

  • Apple Safari 12 & 13.0.5 (On macOS only)
  • Google Chrome 81.0.4044.113
  • Microsoft Edge 81.0.416.58
  • Microsoft Internet Explorer 11 (Does not render PUC correctly using Compatibility Mode)
  • Mozilla Firefox 75
Pentaho Report Designer
  • Apple Safari 12 & 13.0.5 (On macOS only)
  • Google Chrome 81.0.4044.113
  • Microsoft Edge 81.0.416.58
  • Microsoft Internet Explorer 11
  • Mozilla Firefox 75
Pentaho Data Integration (PDI) client*
  • Apple Safari 12 & 13.0.5 (On macOS only)
  • Google Chrome 81.0.4044.113
  • Microsoft Internet Explorer 11 (Required on Windows for data inspection and other features.)
  • Mozilla Firefox 75
*Linux requires libwebkitgtk-1.0. See Use the Pentaho Installation Wizard to Install the PDI Client, Utilities, and Plugins for more information.