This is a Node.js gRPC client for Zeebe, the workflow engine in Camunda Platform 8. It is written in TypeScript and transpiled to JavaScript in the dist
directory.
Comprehensive API documentation is available online.
See CHANGELOG.md to see what has changed with each release.
Docker-compose configurations for Zeebe are available at zeebe-docker-compose.
Get a hosted instance of Zeebe on Camunda Cloud.
- Versioning
- Compatible Node Versions
- Breaking changes in 1.0.0
- gRPC Implementation
- Type difference from other Zeebe clients
- A note on representing timeout durations
Quick Start
Connection Behaviour
- Client-side gRPC retry in ZBClient
- onReady(), onConnectionError(), and connected
- Initial Connection Tolerance
Connecting to a Broker
Job Workers
- Job Workers
- The
ZBWorker
Job Worker - Unhandled Exceptions in Task Handlers
- Completing tasks with success, failure, error, or forwarded
- Working with Process Variables and Custom Headers
- Constraining the Variables Fetched by the Worker
- The "Decoupled Job Completion" pattern
- The
ZBBatchWorker
Job Worker - Long polling
- Poll Interval
Client Commands
- Deploy Process Models and DMN Tables
- Start a Process Instance
- Start a Process Instance of a specific version of a Process definition
- Start a process instance and await the process outcome
- Publish a Message
- Publish a Start Message
- Activate Jobs
Other Concerns
Programming with Safety
- Generating TypeScript constants for BPMN Models
- Generating code from a BPM Model file
- Writing Strongly-typed Job Workers
- Run-time Type Safety
Development of the Library itself
To enable that the client libraries can be easily supported to the Zeebe server we map the version numbers, so that Major, Minor match the server application. Patches are independent and indicate client updates.
NPM Package version 0.26.x supports Zeebe 0.22.x to 0.26.x.
NPM Package version 1.x supports Zeebe 1.x. It uses the C-based gRPC library by default.
NPM Package version 2.x supports Zeebe 1.x, and requires Node >= 16.6.1, >=14.17.5, or >=12.22.5. It removes the C-based gRPC library and uses the pure JS implementation.
Version 1.x of the package: Node versions <=16.x. Version 1.x uses the C-based gRPC library and does not work with Node 17. The C-based gRPC library is deprecated and no longer being maintained.
Version 2.x and later of the package: Node versions 12.22.5+, 14.17.5+, or 16.6.1+. Version 2.x uses the pure JS implementation of the gRPC library, and requires a fix to the nghttp2
library in Node (See #201).
For Zeebe brokers prior to 1.0.0, use the 0.26.z version of zeebe-node
. This README documents the Zeebe 1.0.0 API. The previous API is documented here.
Zeebe 1.0.0 contains a number of breaking changes, including the gRPC protocol and the API surface area. You must use a 1.x.y version of the client library with Zeebe 1.0.0 and later.
The pre-1.0.0 API of the Node client has been deprecated, but not removed. This means that your pre-1.0.0 applications should still work, just by changing the version of zeebe-node
in the package.json
.
From version 2.x, the Zeebe Node client uses the pure JS gRPC client implementation.
For version 1.x, the Zeebe Node client uses the C gRPC client implementation grpc-node by default. The C-based gRPC implementation is deprecated and is not being maintained.
Protobuf fields of type int64
are serialised as type string in the Node library. These fields are serialised as numbers (long) in the Go and Java client. See grpc/#7229 for why the Node library serialises them as string. The Process instance key, and other fields that are of type long in other client libraries, are type string in this library. Fields of type int32
are serialised as type number in the Node library.
All timeouts are ultimately communicated in milliseconds. They can be specified using the primitive type number
, and this is always a number of milliseconds.
All timeouts in the client library can also, optionally, be specified by a time value that encodes the units, using the typed-durations package. You can specify durations for timeouts like this:
const { Duration } = require('zeebe-node')
const timeoutS = Duration.seconds.of(30) // 30s timeout
const timeoutMs = Duration.milliseconds.of(30000) // 30s timeout in milliseconds
Using the value types makes your code more semantically specific.
There are four timeouts to take into account.
The first is the job timeout
. This is the amount of time that the broker allocates exclusive responsibility for a job to a worker instance. By default, this is 60 seconds. This is the default value set by this client library. See "Job Workers".
The second is the requestTimeout
. Whenever the client library sends a gRPC command to the broker, it has an explicit or implied requestTimeout
. This is the amount of time that the gRPC gateway will wait for a response from the broker cluster before returning a 4 DEADLINE
gRPC error response.
If no requestTimeout
is specified, then the configured timeout of the broker gateway is used. Out of the box, this is 15 seconds by default.
The most significant use of the requestTimeout
is when using the createProcessInstanceWithResult
command. If your process will take longer than 15 seconds to complete, you should specify a requestTimeout
. See "Start a Process Instance and await the Process Outcome".
The third is the longpoll
duration. This is the amount of time that the job worker holds a long poll request to activate jobs open.
The final one is the maximum back-off delay in client-side gRPC command retries. See "Client-side gRPC retry in ZBClient".
npm i zeebe-node
For Zeebe broker versions prior to 1.0.0:
npm i zeebe-node@0
Refer to here for the documentation for the pre-1.0.0 version of the library.
const ZB = require('zeebe-node')
void (async () => {
const zbc = new ZB.ZBClient('localhost:26500')
const topology = await zbc.topology()
console.log(JSON.stringify(topology, null, 2))
})()
const ZB = require('zeebe-node')
const fs = require('fs')
void (async () => {
const zbc = new ZB.ZBClient() // localhost:26500 || ZEEBE_GATEWAY_ADDRESS
const res = await zbc.deployProcess('./domain-mutation.bpmn')
console.log(res)
// Deploy multiple with an array of filepaths
await zbc.deployProcess(['./wf1.bpmn', './wf2.bpmn'])
const buffer = fs.readFileSync('./wf3.bpmn')
// Deploy from an in-memory buffer
await zbc.deployProcess({ definition: buffer, name: 'wf3.bpmn' })
})()
If a gRPC command method fails in the ZBClient - such as ZBClient.deployProcess
or ZBClient.topology()
, the underlying gRPC library will throw an exception.
If no workers have been started, this can be fatal to the process if it is not handled by the application logic. This is especially an issue when a worker container starts before the Zeebe gRPC gateway is available to service requests, and can be inconsistent as this is a race condition.
To mitigate against this, the Node client implements some client-side gRPC operation retry logic by default. This can be configured, including disabled, via configuration in the client constructor.
- Operations retry, but only for gRPC error codes 8 and 14 - indicating resource exhaustion (8) or transient network failure (14). Resource exhaustion occurs when the broker starts backpressure due to latency because of load. Network failure can be caused by passing in an unresolvable gateway address (
14: DNS Resolution failed
), or by the gateway not being ready yet (14: UNAVAILABLE: failed to connect to all addresses
). - Operations that fail for other reasons, such as deploying an invalid bpmn file or cancelling a process that does not exist, do not retry.
- Retry is enabled by default, and can be disabled by passing { retry: false } to the client constructor.
- Values for
retry
,maxRetries
andmaxRetryTimeout
can be configured via the environment variablesZEEBE_CLIENT_RETRY
,ZEEBE_CLIENT_MAX_RETRIES
andZEEBE_CLIENT_MAX_RETRY_TIMEOUT
respectively. maxRetries
andmaxRetryTimeout
are also configurable through the constructor options, or through environment variables. By default, if not supplied, the values are:
const { ZBClient, Duration } = require('zeebe-node')
const zbc = new ZBClient(gatewayAddress, {
retry: true,
maxRetries: -1, // infinite retries
maxRetryTimeout: Duration.seconds.of(5)
})
The environment variables are:
ZEEBE_CLIENT_MAX_RETRIES
ZEEBE_CLIENT_RETRY
ZEEBE_CLIENT_MAX_RETRY_TIMEOUT
Retry is provided by promise-retry, and the back-off strategy is simple ^2.
Additionally, the gRPC Client will continually reconnect when in a failed state, such as when the gateway goes away due to pod rescheduling on Kubernetes.
The ZBClient eagerly connects to the broker by issuing a topology command in the constructor. This allows you an onReady event to be emitted. You can disable this (for example, for testing without a broker), by either passing eagerConnection: false
to the client constructor options, or setting the environment variable ZEEBE_NODE_EAGER_CONNECTION
to false
.
The client has a connected
property that can be examined to determine if it has a gRPC connection to the gateway.
The client and the worker can take an optional onReady()
and onConnectionError()
handler in their constructors, like this:
const { ZBClient, Duration } = require('zeebe-node')
const zbc = new ZBClient({
onReady: () => console.log(`Connected!`),
onConnectionError: () => console.log(`Disconnected!`)
})
const zbWorker = zbc.createWorker({
taskType: 'demo-service',
taskHandler: handler,
onReady: () => console.log(`Worker connected!`),
onConnectionError: () => console.log(`Worker disconnected!`)
})
These handlers are called whenever the gRPC channel is established or lost. As the grpc channel will often "jitter" when it is lost (rapidly emitting READY and ERROR events at the transport layer), there is a connectionTolerance
property that determines how long the connection must be in a connected or failed state before the handler is called. By default this is 3000ms.
You can specify another value either in the constructor or via an environment variable.
To specify it via an environment variable, set ZEEBE_CONNECTION_TOLERANCE
to a number of milliseconds.
To set it via the constructor, specify a value for connectionTolerance
like this:
const { ZBClient, Duration } = require('zeebe-node')
const zbc = new ZBClient({
onReady: () => console.log(`Connected!`),
onConnectionError: () => console.log(`Disconnected!`),
connectionTolerance: 5000 // milliseconds
})
const zbWorker = zbc.createWorker({
taskType: 'demo-service',
taskHandler: handler,
onReady: () => console.log(`Worker connected!`),
onConnectionError: () => console.log(`Worker disconnected!`),
connectionTolerance: Duration.seconds.of(3.5)
})
As well as the callback handlers, the client and workers extend the EventEmitter
class, and you can attach listeners to them for the 'ready' and 'connectionError' events:
const { ZBClient, Duration } = require('zeebe-node')
const zbc = new ZBClient()
const zbWorker = zbc.createWorker({
taskType: 'demo-service',
taskHandler: handler,
connectionTolerance: Duration.seconds.of(3.5)
})
zbWorker.on('ready', () => console.log(`Worker connected!`))
zbWorker.on('connectionError', () => console.log(`Worker disconnected!`))
Some broker connections can initially emit error messages - for example: when connecting to Camunda Cloud, during TLS negotiation and OAuth authentication, the eager commands used to detect connection status will fail, and the library will report connection errors.
Since this is expected behaviour - a characteristic of that particular connection - the library has a configurable "initial connection tolerance". This is a number of milliseconds representing the expected window in which these errors will occur on initial connection.
If the library detects that you are connecting to Camunda Cloud, it sets this window to five seconds (5000 milliseconds). In some environments and under some conditions this may not be sufficient (like connecting to Camunda Cloud from your apartment Wi-fi in South Brisbane, Australia on a rainy day while the microwave link to the next suburb's ADSL exchange is degraded).
You can set an explicit value for this using the environment variable ZEEBE_INITIAL_CONNECTION_TOLERANCE
, set to a number of milliseconds.
The effect of this setting is to suppress connection errors during this window, and only report them if the connection did not succeed by the end of the window.
The Node client does not use TLS by default.
Enable a secure connection by setting useTLS: true
:
const { ZBClient } = require('zeebe-node')
const zbc = new ZBClient(tlsSecuredGatewayAddress, {
useTLS: true,
})
Via environment variable:
ZEEBE_SECURE_CONNECTION=true
You can use a self-signed SSL certificate with the Zeebe client. You need to provide the root certificates, the private key and the SSL cert chain as Buffers. You can pass them into the ZBClient constructor:
const rootCertsPath = '/path/to/rootCerts'
const privateKeyPath = '/path/to/privateKey'
const certChainPath = '/path/to/certChain'
const zbc = new ZBClient({
useTLS: true,
customSSL: {
rootCerts: rootCertsPath,
privateKey: privateKeyPath,
certChain: certChainPath
}
})
Or you can put the file paths into the environment in the following variables:
ZEEBE_CLIENT_SSL_ROOT_CERTS_PATH
ZEEBE_CLIENT_SSL_PRIVATE_KEY_PATH
ZEEBE_CLIENT_SSL_CERT_CHAIN_PATH
ZEEBE_SECURE_CONNECTION=true
In this case, they will be passed to the constructor automatically.
In case you need to connect to a secured endpoint with OAuth, you can pass in OAuth credentials. This will enable TLS (unless you explicitly disable it with useTLS: false
), and handle the OAuth flow to get / renew a JWT:
const { ZBClient } = require('zeebe-node')
const zbc = new ZBClient("my-secure-broker.io:443", {
oAuth: {
url: "https://your-auth-endpoint/oauth/token",
audience: "my-secure-broker.io",
clientId: "myClientId",
clientSecret: "randomClientSecret",
cacheOnDisk: true
}
}
The cacheOnDisk
option will cache the token on disk in $HOME/.camunda
, which can be useful in development if you are restarting the service frequently, or are running in a serverless environment, like AWS Lambda.
If the cache directory is not writable, the ZBClient constructor will throw an exception. This is considered fatal, as it can lead to denial of service or hefty bills if you think caching is on when it is not.
If you put a proxy in front of the broker with basic auth, you can pass in a username and password:
const { ZBClient } = require('zeebe-node')
const zbc = new ZBClient("my-broker-with-basic-auth.io:443", {
basicAuth: {
username: "user1",
password: "secret",
},
useTLS: true
}
Basic Auth will also work without TLS.
Camunda Cloud is a hosted SaaS instance of Zeebe. The easiest way to connect is to use the Zero-conf constructor with the Client Credentials from the Camunda Cloud console as environment variables.
You can also connect to Camunda Cloud by using the camundaCloud
configuration option, using the clusterId
, clientSecret
, and clientId
from the Camunda Cloud Console, like this:
const { ZBClient } = require('zeebe-node')
const zbc = new ZBClient({
camundaCloud: {
clientId,
clientSecret,
clusterId,
clusterRegion, // optional, defaults to bru-2
},
})
That's it! Under the hood, the client lib will construct the OAuth configuration for Camunda Cloud and set the gateway address and port for you.
We recommend the Zero-conf constructor with the configuration passed in via environment variables. This allows you to run your application against different environments via configuration.
The ZBClient has a 0-parameter constructor that takes the config from the environment. This is useful for injecting secrets into your app via the environment, and switching between development and production environments with no change to code.
To use the zero-conf constructor, you create the client like this:
const { ZBClient } = require('zeebe-node')
const zbc = new ZBClient()
With no relevant environment variables set, it will default to localhost on the default port with no TLS.
The following environment variable configurations are possible with the Zero-conf constructor:
Camunda Cloud:
ZEEBE_ADDRESS
ZEEBE_CLIENT_SECRET
ZEEBE_CLIENT_ID
Self-hosted or local broker (no TLS or OAuth):
ZEEBE_ADDRESS
Self-hosted with self-signed SSL certificate:
ZEEBE_CLIENT_SSL_ROOT_CERTS_PATH
ZEEBE_CLIENT_SSL_PRIVATE_KEY_PATH
ZEEBE_CLIENT_SSL_CERT_CHAIN_PATH
ZEEBE_SECURE_CONNECTION=true
Self-hosted or local broker with OAuth + TLS:
ZEEBE_CLIENT_ID
ZEEBE_CLIENT_SECRET
ZEEBE_TOKEN_AUDIENCE
ZEEBE_AUTHORIZATION_SERVER_URL
ZEEBE_ADDRESS
Basic Auth:
ZEEBE_BASIC_AUTH_PASSWORD
ZEEBE_BASIC_AUTH_USERNAME
There are two different types of job worker provided by the Zeebe Node client:
- The
ZBWorker
- this worker operates on individual jobs. - The
ZBBatchWorker
- this worker batches jobs on the client, to allow you to batch operations that pool resources. (This worker was introduced in 0.23.0 of the client).
Much of the information in the following ZBWorker
section applies also to the ZBBatchWorker
. The ZBBatchWorker
section covers the features that differ from the ZBWorker
.
The ZBWorker
takes a job handler function that is invoked for each job. It is invoked as soon as the worker retrieves a job from the broker. The worker can retrieve any number of jobs in a response from the broker, and the handler is invoked for each one, independently.
The simplest signature for a worker takes a string task type, and a job handler function.
The job handler receives the job object, which has methods that it can use to complete or fail the job, and a reference to the worker itself, which you can use to log using the worker's configured logger (See Logging).
Note: The second argument is deprecated, and remains for backward-compatibility - it is a complete function. In the 1.0 version of the API, the complete function methods are available on the job
object.
const ZB = require('zeebe-node')
const zbc = new ZB.ZBClient()
const zbWorker = zbc.createWorker('demo-service', handler)
function handler(job, _, worker) {
worker.log('Task variables', job.variables)
// Task worker business logic goes here
const updateToBrokerVariables = {
updatedProperty: 'newValue',
}
return job.complete(updateToBrokerVariables)
}
Here is an example job:
{ key: '578',
type: 'demo-service',
jobHeaders:
{ processInstanceKey: '574',
bpmnProcessId: 'test-process',
processDefinitionVersion: 1,
processKey: '3',
elementId: 'ServiceTask_0xdwuw7',
elementInstanceKey: '577' },
customHeaders: '{}',
worker: 'test-worker',
retries: 3,
deadline: '1546915422636',
variables: { testData: 'something' } }
The worker can be configured with options. To do this, you should use the object parameter constructor.
Shown below are the defaults that apply if you don't supply them:
const { ZBClient, Duration } = require('zeebe-node')
const zbc = new ZBClient()
const zbWorker = zbc.createWorker({
taskType: 'demo-service',
taskHandler: handler,
// the number of simultaneous tasks this worker can handle
maxJobsToActivate: 32,
// the amount of time the broker should allow this worker to complete a task
timeout: Duration.seconds.of(30),
// One of 'DEBUG', 'INFO', 'NONE'
loglevel: 'INFO',
// Called when the connection to the broker cannot be established, or fails
onConnectionError: () => zbWorker.log('Disconnected')
// Called when the connection to the broker is (re-)established
onReady: () => zbWorker.log('Connected.')
})
Note: this behaviour is for the ZBWorker only. The ZBBatchWorker does not manage this.
When a task handler throws an unhandled exception, the library will fail the job. Zeebe will then retry the job according to the retry settings of the task. Sometimes you want to halt the entire process so you can investigate. To have the library cancel the process on an unhandled exception, pass in {failProcessOnException: true}
to the createWorker
call:
const { ZBClient } = require('zeebe-node')
const zbc = new ZBClient()
zbc.createWorker('console-log', maybeFaultyHandler, {
failProcessOnException: true,
})
To complete a task, the job object that the task worker handler function receives has complete
, fail
, and error
methods.
Call job.complete()
passing in a optional plain old JavaScript object (POJO) - a key:value map. These are variable:value pairs that will be used to update the process state in the broker. They will be merged with existing values. You can set an existing key to null
or undefined
, but there is no way to delete a key.
Call job.fail()
to fail the task. You mus t pass in a string message describing the failure. The client library decrements the retry count, and the broker handles the retry logic. If the failure is a hard failure and should cause an incident to be raised in Operate, then pass in 0
for the optional second parameter, retries
:
job.fail('This is a critical failure and will raise an incident', 0)
From version 8.0.0 of the package, used with a 8.0.0 Zeebe broker, you can specify to the broker an optional backoff for the reactivation of the job, like this:
job.fail({
errorMessage: 'Triggering a retry with a two second back-off',
retryBackOff: 2000,
retries: 1,
})
Call job.error()
to trigger a BPMN error throw event. You must pass in a string error code for the error code, and you can pass an optional error message as the second parameter. If no BPMN error catch event exists for the error code, an incident will be raised.
Call job.forwarded()
to release worker capacity to handle another job, without completing the job in any way with the Zeebe broker. This method supports the decoupled job completion pattern. In this pattern, the worker forwards the job to another system - a lambda or a RabbitMQ queue. Some other process is ultimately responsible for completing the job.
Process variables are available in your worker job handler callback as job.variables
, and any custom headers are available as job.customHeaders
.
These are read-only JavaScript objects in the Zeebe Node client. However, they are not stored that way in the broker.
Both process variables and custom headers are stored in the broker as a dictionary of named strings. That means that the variables and custom headers are JSON.parsed in the Node client when it fetches the job, and any update passed to the success()
function is JSON.stringified.
If you accidentally pass in a circular JSON structure to complete()
- like, for example the response object from an HTTP call - it will throw, as this cannot be serialised to a string.
To update a key deep in the object structure of a process variable, you can use the deepmerge utility:
const merge = require('deepmerge')
const { ZBClient } = require('zeebe-node')
const zbc = new ZBClient()
zbc.createWorker('some-task', job => {
const { people } = job.variables
// update bob's age, keeping all his other properties the same
job.complete(merge(people, { bob: { age: 23 } }))
})
When setting custom headers in BPMN tasks, while designing your model, you can put stringified JSON as the value for a custom header, and it will show up in the client as a JavaScript object.
Process variables and custom headers are untyped in the Zeebe broker, however the Node client in TypeScript mode provides the option to type them to provide safety. You can type your worker as any
to turn that off:
// No type checking - totally dynamic and unchecked
zbc.createWorker<any>({
taskType: 'yolo-jobs',
taskHandler: (job, _, worker) => {
worker.log(`Look ma - ${job.variables.anything.goes.toUpperCase()}`)
job.complete({what: job.variables.could.possibly.go.wrong})
}
})
See the section Writing Strongly-typed Job Workers for more details.
Sometimes you only need a few specific process variables to service a job. One way you can achieve constraint on the process variables received by a worker is by using input variable mappings on the task in the model.
You can also use the fetchVariable
parameter when creating a worker. Pass an array of strings, containing the names of the variables to fetch, to the fetchVariable
parameter when creating a worker. Here is an example, in JavaScript:
zbc.createWorker({
taskType: 'process-favorite-albums',
taskHandler: (job, _, worker) => {
const { name, albums } = job.variables
worker.log(`${name} has the following albums: ${albums.join(', ')}`)
job.complete()
},
fetchVariable: ['name', 'albums'],
})
If you are using TypeScript, you can supply an interface describing the process variables, and parameterize the worker:
interface Variables {
name: string
albums: string[]
}
zbc.createWorker<Variables>({
taskType: 'process-favorite-albums',
taskHandler: (job, _, worker) => {
const { name, albums = [] } = job.variables
worker.log(`${name} has the following albums: ${albums?.join?.(', ')}`)
job.complete()
},
fetchVariable: ['name', 'albums'],
})
This parameterization does two things:
- It informs the worker about the expected types of the variables. For example, if
albums
is a string, callingjoin
on it will fail at runtime. Providing the type allows the compiler to reason about the valid methods that can be applied to the variables. - It allows the type-checker to pick up spelling errors in the strings in
fetchVariable
, by comparing them with the Variables typing.
Note, that this does not protect you against run-time exceptions where your typings are incorrect, or the payload simply does not match the definition that you provided.
See the section Writing Strongly-typed Job Workers for more details on run-time safety.
You can turn off the type-safety by typing the worker as any
:
zbc.createWorker<any>({
taskType: 'process-favorite-albums',
taskHandler: (job, _, worker) => {
const { name, albums = [] } = job.variables
// TS 3.7 safe access to .join _and_ safe call, to prevent run-time exceptions
worker.log(`${name} has the following albums: ${albums?.join?.(', ')}`)
job.complete()
},
fetchVariable: ['name', 'albums'],
})
The Decoupled Job Completion pattern uses a Zeebe Job Worker to activate jobs from the broker, and some other asynchronous (remote) system to do the work.
You might activate jobs and then send them to a RabbitMQ queue, or to an AWS lambda. In this case, there may be no outcome about the job that this worker can report back to the broker about success or failure. That will be the responsibility of another part of your distributed system.
The first thing you should do is ensure that you activate the job with sufficient time for the complete execution of your system. Your worker will not be completing the job, but it informs the broker how long the expected loop will take to close.
Next, call job.forward()
in your job worker handler. This has no side-effect with the broker - so nothing is communicated to Zeebe. The job is still out there with your worker as far as Zeebe is concerned. What this call does is release worker capacity to request more jobs.
If you are using the Zeebe Node library in the remote system, or if the remote system eventually reports back to you (perhaps over a different RabbitMQ queue), you can use the ZBClient methods completeJob()
, failJob()
, and throwError()
to report the outcome back to the broker.
You need at least the job.key
, to be able to correlate the result back to Zeebe. Presumably you also want the information from the remote system about the outcome, and any updated variables.
Here is an example:
- You have a COBOL system that runs a database.
- Somebody wrote an adapter for this COBOL database. In executes commands over SSH.
- The adapter is accessible via a RabbitMQ "request" queue, which takes a command and a correlation id, so that its response can be correlated to this request.
- The adapter sends back the COBOL database system response on a RabbitMQ "response" queue, with the correlation id.
- It typically takes 15 seconds for the round-trip through RabbitMQ to the COBOL database and back.
You want to put this system into a Zeebe-orchestrated BPMN model as a task.
Rather than injecting a RabbitMQ listener into the job handler, you can "fire and forget" the request using the decoupled job completion pattern.
Here is how you do it:
- Your worker gets the job from Zeebe.
- Your worker makes the command and sends it down the RabbitMQ "request" queue, with the
job.jobKey
as the correlation id. - Your worker calls
job.forward()
Here is what that looks like in code:
import { RabbitMQSender } from './lib/my-awesome-rabbitmq-api'
import { ZBClient, Duration } from 'zeebe-node'
const zbc = new ZBClient()
const cobolWorker = zbc.createWorker({
taskType: 'cobol-insert',
timeout: Duration.seconds.of(20), // allow 5s over the expected 15s
taskHandler: job => {
const { key, variables } = job
const request = {
correlationId: key,
command: `INSERT ${variables.customer} INTO CUSTOMERS`
}
RabbitMQSender.send({
channel: 'COBOL_REQ',
request
})
// Call forward() to release worker capacity
return job.forward()
}
)
Now for the response part:
- Another part of your system listens to the RabbitMQ response queue.
- It gets a response back from the COBOL adapter.
- It examines the response, then sends the appropriate outcome to Zeebe, using the jobKey that has been attached as the correlationId
import { RabbitMQListener } from './lib/my-awesome-rabbitmq-api'
import { ZBClient } from 'zeebe-node'
const zbc = new ZBClient()
const RabbitMQListener.listen({
channel: 'COBOL_RES',
handler: message => {
const { outcome, correlationId } = message
if (outcome.SUCCESS) {
zbc.completeJob({
jobKey: correlationId,
variables: {}
})
}
if (outcome.ERROR) {
zbc.throwError({
jobKey: correlationId,
errorCode: "5",
errorMessage: "The COBOL Database reported an error. Boo!"
})
}
})
}
See also the section "Publish a Message", for a pattern that you can use when it is not possible to attach the job key to the round trip data response.
The ZBBatchWorker
Job Worker batches jobs before calling the job handler. Its fundamental differences from the ZBWorker are:
- Its job handler receives an array of one or more jobs.
- The handler is not invoked immediately, but rather when enough jobs are batched, or a job in the batch is at risk of being timed out by the Zeebe broker.
You can use the batch worker if you have tasks that benefit from processing together, but are not related in the BPMN model.
An example would be a high volume of jobs that require calls to an external system, where you have to pay per call to that system. In that case, you may want to batch up jobs, make one call to the external system, then update all the jobs and send them on their way.
The batch worker works on a first-of batch size or batch timeout basis.
You must configure both jobBatchMinSize
and jobBatchMaxTime
. Whichever condition is met first will trigger the processing of the jobs:
- Enough jobs are available to the worker to satisfy the minimum job batch size;
- The batch has been building for the maximum amount of time - "we're doing this now, before the earliest jobs in the batch time out on the broker".
You should be sure to specify a timeout
for your worker that is jobBatchMaxTime
plus the expected latency of the external call plus your processing time and network latency, to avoid the broker timing your batch worker's lock and making the jobs available to another worker. That would defeat the whole purpose.
Here is an example of using the ZBBatchWorker
:
import { API } from './lib/my-awesome-external-api'
import { ZBClient, BatchedJob, Duration } from 'zeebe-node'
const zbc = new ZBClient()
// Helper function to find a job by its key
const findJobByKey = jobs => key => jobs.filter(job => job.jobKey === id)?.[0] ?? []
const handler = async (jobs: BatchedJob[], worker: ZBBatchWorker) => {
worker.log("Let's do this!")
const {jobKey, variables} = job
// Construct some hypothetical payload with correlation ids and requests
const req = jobs.map(job => ({id: jobKey, data: variables.request}))
// An uncaught exception will not be managed by the library
try {
// Our API wrapper turns that into a request, and returns
// an array of results with ids
const outcomes = await API.post(req)
// Construct a find function for these jobs
const getJob = findJobByKey(jobs)
// Iterate over the results and call the succeed method on the corresponding job,
// passing in the correlated outcome of the API call
outcomes.forEach(res => getJob(res.id)?.complete(res.data))
} catch (e) {
jobs.forEach(job => job.fail(e.message))
}
}
const batchWorker = zbc.createBatchWorker({
taskType: 'get-data-from-external-api',
taskHandler: handler,
jobBatchMinSize: 10, // at least 10 at a time
jobBatchMaxTime: 60, // or every 60 seconds, whichever comes first
timeout: Duration.seconds.of(80) // 80 second timeout means we have 20 seconds to process at least
})
See this blog post for some more details on the implementation.
With Zeebe 0.21 onward, long polling is supported for clients, and is used by default. Rather than polling continuously for work and getting nothing back, a client can poll once and leave the request open until work appears. This reduces network traffic and CPU utilization in the server. Every JobActivation Request is appended to the event log, so continuous polling can significantly impact broker performance, especially when an exporter is loaded (see here).
Long polling sends the ActivateJobs
command to the broker, and waits for up to the long poll interval for jobs to be available, rather than returning immediately with an empty response if no jobs are available at that moment.
The default long poll duration is 30s.
To use a different long polling duration, pass in a long poll timeout in milliseconds to the client. All workers created with that client will use it. Alternatively, set a period per-worker.
Long polling for workers is configured in the ZBClient like this:
const { ZBClient, Duration } = require('zeebe-node')
const zbc = new ZBClient('serverAddress', {
longPoll: Duration.minutes.of(10), // Ten minutes - inherited by workers
})
const longPollingWorker = zbc.createWorker({
taskType: 'task-type',
taskHandler: handler,
longPoll: Duration.minutes.of(2), // override client, poll 2m
})
The poll interval is a timer that fires on the configured interval and sends an ActivateJobs
command if no pending command is currently active. By default, this is set to 300ms. This guarantees that there will be a minimum of 300ms between ActivateJobs
commands, which prevents flooding the broker.
Too many ActivateJobs
requests per period of time can cause broker backpressure to kick in, and the gateway to return a GRPC 8 error code.
You can configure this with the pollInterval
option in the client constructor, in which case all workers inherit it as their default. You can also override this by specifying a value in the createWorker
call:
const zbc = new ZBClient({
pollInterval: Duration.milliseconds.of(500),
})
const worker = zbc.createWorker({
taskType: 'send-email',
taskHandler: sendEmailWorkerHandler,
pollInterval: Duration.milliseconds.of(750),
})
From version 8 of Zeebe, deployProcess
in deprecated in favor of deployResource
which allows you to deploy both process models and DMN tables.
You can deploy a resource as a buffer, or by passing a filename - in which case the client library will load the file into a buffer for you.
By passing a filename, and allowing the client library to load the file into a buffer:
async function deploy() {
const zbc = new ZBClient()
const result = await zbc.deployResource({
processFilename: `./src/__tests__/testdata/Client-DeployWorkflow.bpmn`,
})
}
By passing a buffer, and a name:
async function deploy() {
const zbc = new ZBClient()
const process = fs.readFileSync(
`./src/__tests__/testdata/Client-DeployWorkflow.bpmn`
)
const result = await zbc.deployResource({
process,
name: `Client-DeployWorkflow.bpmn`,
})
}
By passing a filename, and allowing the client library to load the file into a buffer:
async function deploy() {
const zbc = new ZBClient()
const result = await zbc.deployResource({
decisionFilename: `./src/__tests__/testdata/quarantine-duration.dmn`,
})
}
By passing a buffer, and a name:
async function deploy() {
const zbc = new ZBClient()
const decision = fs.readFileSync(
`./src/__tests__/testdata/quarantine-duration.dmn`
)
const result = await zbc.deployResource({
decision,
name: `quarantine-duration.dmn`,
})
}
const ZB = require('zeebe-node')
;(async () => {
const zbc = new ZB.ZBClient('localhost:26500')
const result = await zbc.createProcessInstance('test-process', {
testData: 'something',
})
console.log(result)
})()
Example output:
{ processKey: '3',
bpmnProcessId: 'test-process',
version: 1,
processInstanceKey: '569' }
From version 0.22 of the client onward:
const ZB = require('zeebe-node')
;(async () => {
const zbc = new ZB.ZBClient('localhost:26500')
const result = await zbc.createProcessInstance({
bpmnProcessId: 'test-process',
variables: {
testData: 'something',
},
version: 5,
})
console.log(result)
})()
From version 0.22 of the broker and client, you can await the outcome of a process end-to-end execution:
async function getOutcome() {
const result = await zbc.createProcessInstanceWithResult(processId, {
sourceValue: 5,
})
return result
}
Be aware that by default, this will throw an exception if the process takes longer than 15 seconds to complete.
To override the gateway's default timeout for a process that needs more time to complete:
const { ZBClient, Duration } = require('zeebe-node')
const zbc = new ZBClient()
const result = await zbc.createProcessInstanceWithResult({
bpmnProcessId: processId,
variables: {
sourceValue: 5,
otherValue: 'rome',
},
requestTimeout: Duration.seconds.of(25),
// also works supplying a number of milliseconds
// requestTimeout: 25000
})
You can publish a message to the Zeebe broker that will be correlated with a running process instance:
const { ZBClient, Duration } = require('zeebe-node')
const zbc = new ZBClient()
zbc.publishMessage({
correlationKey: 'value-to-correlate-with-process-variable',
messageId: uuid.v4(),
name: 'message-name',
variables: { valueToAddToProcessVariables: 'here', status: 'PROCESSED' },
timeToLive: Duration.seconds.of(10), // seconds
})
When would you do this? Well, the sky is not even the limit when it comes to thinking creatively about building a system with Zeebe - and here's one concrete example to get you thinking:
Recall the example of the remote COBOL database in the section "The "Decoupled Job Completion" pattern". We're writing code to allow that system to be participate in a BPMN-modelling process orchestrated by Zeebe.
But what happens if the adapter for that system has been written in such a way that there is no opportunity to attach metadata to it? In that case we have no opportunity to attach a job key. Maybe you send the fixed data for the command, and you have to correlate the response based on those fields.
Another example: think of a system that emits events, and has no knowledge of a running process. An example from one system that I orchestrate with Zeebe is Minecraft. A logged-in user in the game performs some action, and code in the game emits an event. I can catch that event in my Node-based application, but I have no knowledge of which running process to target - and the event was not generated from a BPMN task providing a worker with the complete context of a process.
In these two cases, I can publish a message to Zeebe, and let the broker figure out which processes are:
- Sitting at an intermediate message catch event waiting for this message; or
- In a sub-process that has a boundary event that will be triggered by this message; or
- Would be started by a message start event, on receiving this message.
The Zeebe broker correlates a message to a running process instance not on the job key - but on the value of one of the process variables (for intermediate message events) and the message name (for all message events, including start messages).
So the response from your COBOL database system, sans job key, is sent back to Zeebe from the RabbitMQListener not via completeJob()
, but with publishMessage()
, and the value of the payload is used to figure out which process it is for.
In the case of the Minecraft event, a message is published to Zeebe with the Minecraft username, and that is used by Zeebe to determine which processes are running for that user and are interested in that event.
See the article "Zeebe Message Correlation" for a complete example with code.
You can also publish a message targeting a Message Start Event.
In this case, the correlation key is optional, and all Message Start events that match the name
property will receive the message.
You can use the publishStartMessage()
method to publish a message with no correlation key (it will be set to a random uuid in the background):
const { ZBClient, Duration } = require('zeebe-node')
const zbc = new ZB.ZBClient('localhost:26500')
zbc.publishStartMessage({
messageId: uuid.v4(),
name: 'message-name',
variables: { initialProcessVariable: 'here' },
timeToLive: Duration.seconds.of(10), // seconds
})
Both normal messages and start messages can be published idempotently by setting both the messageId
and the correlationKey
. They will only ever be correlated once. See: A message can be published idempotent.
If you have some use case that doesn't fit the existing workers, you can write your own custom workers using the ZBClient.activateJobs()
method. It takes an ActivateJobsRequest
object, and returns a stream for that call.
Attach a listener to the stream's 'data' event, and it will be called with an ActivateJobsResponse
object if there are jobs to work on.
To complete these jobs, use the ZBClient
methods completeJob()
, failJob()
, and throwError()
.
For more details, read the source code of the library, particularly the ZBWorkerBase
class. This is an advanced use case, and the existing code in the library is the best documentation.
To drain workers, call the close()
method of the ZBClient. This causes all workers using that client to stop polling for jobs, and returns a Promise that resolves when all active jobs have either finished or timed out.
console.log('Closing client...')
zbc.close().then(() => console.log('All workers closed'))
Control the log output for the client library by setting the ZBClient log level. Valid log levels are NONE
(supress all logging), ERROR
(log only exceptions), INFO
(general logging), or DEBUG
(verbose logging). You can set this in the client constructor:
const zbc = new ZBClient('localhost', { loglevel: 'DEBUG' })
And also via the environment:
ZEEBE_NODE_LOG_LEVEL='ERROR' node start.js
By default the library uses console.info
and console.error
for logging. You can also pass in a custom logger, such as pino:
const logger = require('pino')()
const zbc = new ZBClient({ stdout: logger })
From version v0.23.0-alpha.1, the library logs human-readable logs by default, using the ZBSimpleLogger
. If you want structured logs as stringified JSON, pass in ZBJSONLogger
to the constructor stdout
option, like this:
const { ZBJsonLogger, ZBClient } = require('zeebe-node')
const zbc = new ZBClient({ stdout: ZBJsonLogger })
You can also control this via environment variables:
export ZEEBE_NODE_LOG_TYPE=SIMPLE # Simple Logger (default)
export ZEEBE_NODE_LOG_TYPE=JSON # JSON Logger
Message names and Task Types are untyped magic strings. You can generate type information to avoid some classes of errors.
Install the package globally:
npm i -g zeebe-node
Now you have the command zeebe-node <filename>
that parses a BPMN file and emits type definitions.
The BpmnParser
class provides a static method generateConstantsForBpmnFiles()
.
This method takes a filepath and returns TypeScript definitions that you can use to avoid typos in your code, and to reason about the completeness of your task worker coverage.
const ZB = require('zeebe-node')
;(async () => {
console.log(await ZB.BpmnParser.generateConstantsForBpmnFiles(processFile))
})()
This will produce output similar to:
// Autogenerated constants for msg-start.bpmn
export enum TaskType = {
CONSOLE_LOG = "console-log"
};
export enum MessageName = {
MSG_EMIT_FRAME = "MSG-EMIT_FRAME",
MSG_START_JOB = "MSG-START_JOB"
};
You can scaffold your worker code from a BPMN file with the zeebe-node
command. To use this command, install the package globally with:
npm i -g zeebe-node
Pass in the path to the BPMN file, and it will output a file to implement it:
zeebe-node my-model.bpmn
You can provide interfaces to get design-time type safety and intellisense on the process variables passed in the a worker job handler, the custom headers that it will receive, and the variables that it will pass back to Zeebe in the complete.success
call:
interface InputVariables {
name: string,
age: number,
preferences: {
beverage: 'Coffee' | 'Tea' | 'Beer' | 'Water',
color: string
}
}
interface OutputVariables {
suggestedGift: string
}
interface CustomHeaders {
occasion: 'Birthday' | 'Christmas' | 'Hannukah' | 'Diwali'
}
const giftSuggester = zbc.createWorker<
InputVariables,
CustomHeaders,
OutputVariables>
('get-gift-suggestion', (job, complete) => {
const suggestedGift = `${job.customHeaders.occasion} ${job.variables.preferences.beverage}`
complete.success({ suggestedGift })
})
If you decouple the declaration of the job handler from the createWorker
call, you will need to explicitly specify its type, like this:
import { ZBWorkerTaskHandler } from 'zeebe-node'
function getGiftSuggestion(job, complete): ZBWorkerTaskHandler<InputVariables, CustomHeaders, OutputVariables> {
const suggestedGift = `${job.customHeaders.occasion} ${job.variables.preferences.beverage}`
complete.success({ suggestedGift })
}
const giftSuggester = zbc.createWorker('get-gift-suggestion', getGiftSuggestion)
The parameterization of the client and workers helps to catch errors in code, and if your interface definitions are good, can go a long way to making sure that your workers and client emit the correct payloads and have a strong expectation about what they will receive, but it does not give you any run-time safety.
Your type definition may be incorrect, or the variables or custom headers may simply not be there at run-time, as there is no type checking in the broker, and other factors are involved, such as tasks with input and output mappings, and data added to the process variables by REST calls and other workers.
You should consider:
- Writing interface definitions for your payloads to get design-time assist for protection against spelling errors as you demarshal and update variables.
- Testing for the existence of variables and properties on payloads, and writing defensive pathways to deal with missing properties. If you mark everything as optional in your interfaces, the type-checker will force you to write that code.
- Surfacing code exceptions operationally to detect and diagnose mismatched expectations.
- If you want to validate inputs and outputs to your system at runtime, you can use io-ts. Once data goes into that, it either exits through an exception handler, or is guaranteed to have the shape of the defined codec at run-time.
As with everything, it is a balancing act / trade-off between correctness, safety, and speed. You do not want to lock everything down while you are still exploring.
I recommend the following scale, to match the maturity of your system:
- Start with
<any>
typing for the workers; then - Develop interfaces to describe the DTOs represented in your process variables;
- Use optional types on those interfaces to check your defensive programming structures;
- Lock down the run-time behaviour with io-ts as the boundary validator.
You may choose to start with the DTOs. Anyway, there are options.
The source is written in TypeScript in src
, and compiled to ES6 in the dist
directory.
To build:
npm run build
To start a watcher to build the source and API docs while you are developing:
npm run dev
Tests are written in Jest, and live in the src/__tests__
directory. To run the unit tests:
npm t
Integration tests are in the src/__tests__/integration
directory.
They require a Zeebe broker to run. You can start a dockerised broker:
cd docker
docker-compose up
And then run them manually:
npm run test:integration
For the failure test, you need to run Operate (docker-compose config) and manually verify that an incident has been raised at http://localhost:8080.
Zeebe is inherently stateful, so integration tests need to be carefully isolated so that workers from one test do not service tasks in another test. Jest runs tests in a random order, so intermittent failures are the outcome of tests that mutate shared state.
The tests use a templating function to replace the process id, task types and message names in the bpmn model to produce distinct, isolated namespaces for each test and each test run.
Name |
---|
Josh Wulf |
Colin Raddatz |
Jarred Filmer |
Timothy Colbert |
Olivier Albertini |
Patrick Dehn |