Skip to main content

Pentaho+ documentation has moved!

The new product documentation portal is here. Check it out now at docs.hitachivantara.com

 

Hitachi Vantara Lumada and Pentaho Documentation

Options

The Bulk load into Azure SQL DB entry includes four tabs to define the data input source, the output database and location, options, and advanced options for loading PDI data into an Azure SQL database. Each tab is described below.

Input tab

Bulk load into Azure SQL DB Input tab
OptionDescription
SourceSpecify the VFS URL of the data to import. The URL can point to a file or a folder in Azure Data Lake Storage. The URL must begin with pvfs:// and must specify the bucket and object you want to load. See Pentaho address to a VFS connection .
What file type is your source?Select the file type of the input source. Must be a CSV file.

Delimited text settings

Specify the following settings for a delimited text file:

  • Delimiter

    Specify the character used to separate a data field. The default is semicolon (;).

  • First line is column names

    Select Yes if the first line in the file contains the column names.

Output tab

Bulk load into Azure SQL DB Output tab
OptionDescription
Database connection

Select your database connection from a list of existing Azure SQL DB connections.

If you do not have an existing connection, click New. If you need to modify an existing connection, click Edit.

See Set up Pentaho to connect to an Azure SQL database.

SchemaSelect the schema to use for the bulk load. The Bulk load into Azure SQL DB entry reads the schemas that exist in the database to populate this list.
Table nameSelect the name of the table to bulk load. The Bulk load into Azure SQL DB entry reads the table names from your selected schema to populate this list.
ColumnsPreview of the column names and associated data types within your selected table.
NoteYou can use variables in the schema and table name fields.

Options tab

Bulk load into Azure SQL DB Options tab
OptionDescription
Truncate tableSelect to remove all the data in the table before bulk loading the current data. Clear to append data during a new data load (default).

Advanced options tab

Bulk load into Azure SQL DB Advanced options tab
OptionDescription
Force

Specify whether to force loading of files into a database:

  • True

    Loads data to the table even if the data was already loaded from that file before. This option can potentially duplicate data in a table.

  • False

    Ignores staged data files already loaded into the table (default).