Skip to main content

Pentaho+ documentation has moved!

The new product documentation portal is here. Check it out now at docs.hitachivantara.com

 

Hitachi Vantara Lumada and Pentaho Documentation

Create a CSV data source

Parent article
Before you begin working with CSV data sources, there are a few key terms that you should know.
  • Delimiter

    A character, such as a comma, used to specify a boundary between separate regions in a data stream.

  • Enclosure

    A container that holds a collection of other data objects.

  • Length

    Indicates the maximum number of characters allowed in a field.

  • Precision

    The number of digits after a decimal point.

Procedure

  1. Log in to the User Console.

  2. Click Create New, then choose Data Source from the menu.

  3. Click New Data Source. The Data Source Wizard appears.

  4. Enter a name that identifies your new data source in the Data Source Name field. The following characters are not allowed in data source names:

    %/:[]*|\t\r\n
    Data Source Wizard
  5. Select CSV File from the Source Type drop-down menu.

  6. Click Import to browse for your CSV file. Double-click to select the CSV file you want to upload.

  7. Choose your delimiter and enclosure types.

    If you want to use the first row as data, disable the First row is header check box.Data Source Wizard CSV First row is header
  8. The File Preview window displays the first few lines of your CSV file based on the selections you made for the delimiter, enclosure, and header. Once the columns align correctly in the preview, the delimiter and enclosure have been set correctly.

    If you want to use the first row of your CSV file as headings for columns in the file, leave First row is header check box selected.
  9. Click Next.

    The Staging Settings screen displays a list of columns from your CSV source file. All columns are enabled.Data Source Wizard Staging Settings
  10. Choose the columns that you want to use in your data source, either individually or by clicking Select All.

    You can deselect all columns by clicking Deselect All.
  11. Change the Name and Type values, if applicable.

    1. Choose the options that you want to use from the drop-down menu for dates and numeric values.
    2. You can enter a value manually in the Source Format text box.
    NoteDrop-down lists are not enabled for certain data types such as the String data type. Boolean values are rendered as true or false.
  12. Click Show File Contents to look at a sample of the data in your source file. Click Close to return to the Staging Settings screen.

    Data Source Wizard File Preview
  13. Continue to work with your CSV data settings or click Finish.

    The Data Source Created window appears.
  14. You can choose to Keep default model or click Customize model now to launch the Data Source Model Editor and refine the model. Click OK.

Results

Your new data source is now available for use in Analyzer, Interactive Reports, and Dashboard Designer reports, or the Data Source Model Editor appears.