Components Reference
Pentaho aims to accommodate diverse computing environments. This list provides details about the environment components and versions we support. If you have questions about your particular computing environment, please contact Pentaho Support.
Server
The Pentaho server is hardware-independent and runs on server-class computers that comply with these specifications for minimum hardware and required operating systems:
Hardware—64 bit | Operating System—64 bit |
---|---|
Processor: Intel EM64T or AMD64 Dual-Core
RAM: 8 GB with 4 GB dedicated to Pentaho servers Disk Space: 20 GB free after installation |
|
Workstation
These Pentaho design tools are hardware-independent and run on client-class computers that comply with these specifications for minimum hardware and required operation systems.
- Pentaho Aggregation Designer
- Pentaho Data Integration
- Pentaho Metadata Editor
- Pentaho Report Designer
- Pentaho Schema Workbench
Hardware—64 bit | Operating System—64 bit |
---|---|
Processors:
Disk Space: 2 GB free after installation Minimum Screen Size: 1280 x 960 |
|
Embedded Software
When embedding Pentaho software into other applications, the computing environment should comply with these specifications for minimum hardware and required operation systems.
- Embedded Pentaho Reporting
- Embedded Pentaho Analysis
- Embedded Pentaho Data Integration
Hardware—64 bit | Operating System—64 bit |
---|---|
Processors: Intel EM64T or AMD64 Dual-Core RAM: 8 GB with 4 GB dedicated to Pentaho servers Disk Space: 20 GB free after installation |
|
Application Servers
Servers to which you deploy Pentaho software must run one of these application servers:
- JBoss EAP 7.0 with Oracle Java 8.x
- Tomcat 8.0 & 8.5 (default) with Oracle Java 8.x (as shipped)
Solution Database Repositories
Pentaho software stores processing artifacts in these database repositories:
- MySQL 5.6 & 5.7 (SQL 92)
- Oracle 11.2 & 12.1 (SQL 92)
- PostgreSQL 9.5 & 9.6*
- MS SQL Server 2014, 2016
*Default installed solution database
Data Sources
Pentaho software connects to these relational and non-relational data sources.
Pentaho Software | Data Source |
---|---|
Pentaho Reporting |
|
Pentaho Server, Action Sequences |
|
Pentaho Data Integration |
|
**Use a JDBC 3.x or 4.x compliant driver that is compatible with SQL-92 standards when communicating with relational data sources. For your convenience, we provide a list of drivers used to get data from relational JDBC databases.
Big Data Sources
Pentaho software connects to these big data sources.
Data Source | Versions | Analyzer | PIR/ PDD |
PR | DSW | PDI Server/ Spoon |
PRD | PSW | PME |
---|---|---|---|---|---|---|---|---|---|
Amazon EMR | 5.9, 5.11* | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Cloudera | 5.12, 5.13, 5.14** (for job execution) |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
via Impala*** (as data source) |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
|
via Hive2**** (as data source) |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
|
Datastax | 4.6, 4.8 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Google BigQuery | 1.1.5.1005******* | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Greenplum | 4.2, 4.3 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Hortonworks |
2.5, 2.6 (for job execution) |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
via Hive2**** (as data source) |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
|
via Spark SQL***** (as data source) |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
|
MapR | 5.2, 6.0****** (for job execution) |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
via Hive2 (as data source) |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
|
Microsoft Azure HDInsight | 3.5 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
MongoDB | 3.4, 3.6 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Netezza | 7.1, 7.2 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
SAP HANA | SPS | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Teradata | 14.10, 15.0 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Vertica | 7.1, 7.2 | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
The following shims are included in the Pentaho Big Data plugin for Pentaho Version 8.1: Cloudera Distribution for Hadoop 5.13, Amazon EMR 5.11, Hortonworks 2.6, and MapR 6.0. Other supported shims can be downloaded from the Pentaho Customer Support Portal.
* EMR 5.9 shims do not support Impala. Due to an existing Amazon EMR issue, the EMR 5.9 shims do not support S3 Block Storage type as output for PMR jobs. Support for S3A is provided with EMR 5.9.
** The CDH 5.12, CDH 5.13, and CDH 5.14 shims have been certified with the Impala JDBC Connector 2.5.43 Cloudera Simba driver. You must have the current version of the Pentaho release to use either the CDH 5.12 or CDH 5.13 shim.
*** As with any data source, the performance of Pentaho Analyzer on Impala will be dependent upon the data shape, Impala’s configuration, and the types of queries. See the Customer Portal best practice article concerning Pentaho Analyzer on Impala for more information.
**** Hive2 as a data source for CDH also supports Hive on Spark. Hive2 as a data source for HDP also supports Hive on Tez.
***** The Simba Spark SQL driver needs to be downloaded, installed, and configured to be used as a data source for Hortonworks. See our instructions on downloading, installing, and configuring for more information.
****** MapR client 6.0 does not support Hbase, and only supports MapR-DB, which requires an additional setting. See the MapR Mapping to HBase Table Namespaces documentation for more information on this setting. Due to MapR limitations, Hbase comparators are not supported.
******* The Simba driver required for Google BigQuery is the JDBC 4.2-compatible version. See https://cloud.google.com/bigquery/partners/simba-drivers/ .
SQL Dialect-Specific
Pentaho software generates dialect-specific SQL when communicating with these data sources.
Pentaho Software | Data Source |
---|---|
Pentaho Analysis |
|
Pentaho Metadata |
|
Pentaho Data Integration |
|
****If your data source is not in this list and is compatible with SQL-92, Pentaho software uses a generic SQL dialect.
Security
Pentaho software integrates with these third-party security authentication systems:
- Active Directory
- CAS
- Integrated Microsoft Windows Authentication
- LDAP
- RDBMS
Java Virtual Machine
Pentaho software requirements for Java Runtime Environment (JRE).
Pentaho Software | Java Runtime Environment |
---|---|
All Pentaho software. |
|
Web Browsers
Pentaho supports major versions of Web browsers that are publicly available six weeks prior to the finalization of a Pentaho release, except as noted below.
Pentaho Software | Web Browser |
---|---|
Pentaho User Console (PUC) Pentaho recommends 2GB RAM for the web client |
|
Pentaho Report Designer |
|
Pentaho Data Integration (PDI) client***** |
|
*****Linux requires libwebkitgtk-1.0.