Extracting Data from a Hadoop Cluster
These tutorials contain guidance and instructions on extracting data from Hadoop using HDFS, Hive, and HBase.
-
Extracting Data from HDFS to Load an RDBMS—How to use a PDI transformation to extract data from HDFS and load it into a RDBMS table.
-
Extracting Data from Hive to Load an RDBMS—How to use a PDI transformation to extract data from Hive and load it into a RDBMS table.
-
Extracting Data from HBase to Load an RDBMS—How to use a PDI transformation to extract data from HBase and load it into a RDBMS table.
-
Extracting Data from Snappy Compressed Files—How to configure client-side PDI so that files compressed using the Snappy codec can be decompressed using the Hadoop file input or Text file input step.