/jpmml-evaluator-python

PMML evaluator library for Python

Primary LanguagePythonGNU Affero General Public License v3.0AGPL-3.0

JPMML-Evaluator-Python Build Status

PMML evaluator library for Python.

Features

This package provides Python wrapper classes and functions for the JPMML-Evaluator library.

Prerequisites

  • Java Platform, Standard Edition 8 or newer.
  • Python 3.9 or newer.

Installation

Installing a release version from PyPI:

pip install jpmml_evaluator

Alternatively, installing the latest snapshot version from GitHub:

pip install --upgrade git+https://github.com/jpmml/jpmml-evaluator-python.git

Usage

Command-line application

The jpmml_evaluator module is executable. The main application loads loads the model from the PMML file, and scores all data records from the input CSV file or stream; scoring results are written to the output CSV file or stream:

python -m jpmml_evaluator DecisionTreeIris.pmml --input Iris.csv --output DecisionTreeIris.csv

If the input CSV file (-i or --input) and/or the output CSV file (-o or --output) is not specified, then system input (sys.stdin) and system output (sys.stdout), respectively, are assumed:

python -m jpmml_evaluator DecisionTreeIris.pmml < Iris.csv > DecisionTreeIris.csv

Getting help:

python -m jpmml_evaluator --help

One some platforms, the Pip package installer additionally makes the main application available as a top-level command:

jpmml_evaluator DecisionTreeIris.pmml < Iris.csv > DecisionTreeIris.csv

Library

Java-to-Python API mapping

Guiding principles:

  1. Java package prefix org.jpmml.evaluator becomes Python package prefix jpmml_evaluator.
  2. Java classes and interfaces become Python classes with the same name.
  3. Java methods become Python methods with the same name. In case of method overloading, the names of Python methods may have a disambiguating suffix (eg. loadFile, loadInputStream) appended to them.
  4. Java parameter types become Python parameter types.

For example, the Java method org.jpmml.evaluator.Evaluator#evaluate(Map<FieldName, ?> arguments) has become a Python method jpmml_evaluator.Evaluator.evaluate(arguments: dict).

Java backend

The communication with the JPMML-Evaluator library is managed by a jpmml_evaluator.JavaBackend object.

Currently, it's possible to choose between three backend implementations:

  JPype PyJNIus Py4J
GitHub project page jpype-project/jpype kivy/pyjnius bartdag/py4j
Python package jpype1 pyjnius py4j
Implementation class jpmml_evaluator \ .jpype.JPypeBackend jpmml_evaluator \ .pyjnius.PyJNIusBackend jpmml_evaluator \ .py4j.Py4JBackend
Implementation class alias "jpype" "pyjnius" "py4j"
Type Local JVM via JNI Local JVM via JNI Local or remote JVM via TCP/IP sockets
Restartable No No Yes
Timing* for evaluate(X) 8.1 -- 9.4 10.8 -- 11.9 37.3 -- 40.2
Timing* for evaluateAll(X) 2.55 -- 2.95 2.77 -- 3.18 3.27 -- 3.62

[*] - Relative timings (smaller is better).

Workflow

Building a verified model evaluator from a PMML file:

from jpmml_evaluator import make_evaluator

evaluator = make_evaluator("DecisionTreeIris.pmml") \
	.verify()

Printing model schema:

inputFields = evaluator.getInputFields()
print("Input fields: " + str([inputField.getName() for inputField in inputFields]))

targetFields = evaluator.getTargetFields()
print("Target field(s): " + str([targetField.getName() for targetField in targetFields]))

outputFields = evaluator.getOutputFields()
print("Output fields: " + str([outputField.getName() for outputField in outputFields]))

Evaluating a single data record:

arguments = {
	"Sepal_Length" : 5.1,
	"Sepal_Width" : 3.5,
	"Petal_Length" : 1.4,
	"Petal_Width" : 0.2
}

results = evaluator.evaluate(arguments)
print(results)

Evaluating a collection of data records:

import pandas

arguments_df = pandas.read_csv("Iris.csv", sep = ",")

results_df = evaluator.evaluateAll(arguments_df)
print(results_df)

License

JPMML-Evaluator-Python is licensed under the terms and conditions of the GNU Affero General Public License, Version 3.0. For a quick summary of your rights ("Can") and obligations ("Cannot" and "Must") under AGPLv3, please refer to TLDRLegal.

If you would like to use JPMML-Evaluator-Python in a proprietary software project, then it is possible to enter into a licensing agreement which makes it available under the terms and conditions of the BSD 3-Clause License instead.

Additional information

JPMML-Evaluator-Python is developed and maintained by Openscoring Ltd, Estonia.

Interested in using JPMML software in your software? Please contact info@openscoring.io