/pdi-fastjsoninput-plugin

FastJsonInput step plugin for Pentaho Data Integration

Primary LanguageJavaApache License 2.0Apache-2.0

Fast JSON Input - PDI Plugin

NOTE - this repository is deprecated. Jesse Adametz and James Ebentier from Graphiq picked up the code where I left it and improved on it: https://github.com/graphiq-data/pdi-fastjsoninput-plugin. Their fork is more likely to be actively maintained, we suggest that you have a look over there before using this version.

This is an alternate version of the "JSON Input" step that uses Jayway JsonPath instead of a parser based on JavaScript. It is intended to be a drop-in replacement for the "JSON Input" step but should be much faster and memory efficient.

Features over PDI JSON Input

  • [PDI-10344] Replaced JavaScript parsing engine with Jayway JsonPath
  • [PDI-10858] Checkbox to "Remove source field from output stream"
  • Checkbox to enable JsonPath's DEFAULT_PATH_LEAF_TO_NULL option which returns null for missing leafs:
[
    {
        "name": "Jesse Adametz",
        "gender": "male"
    },
    {
        "name": "Etienne Dube"
    }
]
  • Tests! There are currently 4 test cases which test the permutations of "Ignore missing path" and "Default path leaf to null"

Development

Build

To build (requires Apache Maven 3 or later):

mvn package

Install

  1. Simply create install.properties in the root directory with the following line:

    pdi.home=/path/to/local/data-integration
    
  2. Then run

    mvn install

Authors: