Skip to main content

Pentaho+ documentation has moved!

The new product documentation portal is here. Check it out now at


Hitachi Vantara Lumada and Pentaho Documentation

HBase Input

Parent article

This step reads data from an HBase table according to user-defined column metadata. HBase is a distributed, column-oriented database that provides random read and write access to the Hadoop File System. HBase stores all data as raw bytes without any associated metadata. A mapping provides metadata that allows the step to decode the binary values properly.

Select an engine

You can run the HBase Input step on the Pentaho engine or on the Spark engine. Depending on your selected engine, the transformation runs differently. Select one of the following options to view how to set up the HBase Input step for your selected engine.

For instructions on selecting an engine for your transformation, see Run configurations.