/signalfx-python

Python client library and instrumentation bindings for SignalFx

Primary LanguagePythonApache License 2.0Apache-2.0

Python client library for SignalFx

This is a programmatic interface in Python for SignalFx's metadata and ingest APIs. It is meant to provide a base for communicating with SignalFx APIs that can be easily leveraged by scripts and applications to interact with SignalFx or report metric and event data to SignalFx. It is also the base for metric reporters that integrate with common Python-based metric collections tools or libraries.

Installation

To install with pip:

$ pip install signalfx

To install from source:

$ git clone https://github.com/signalfx/signalfx-python.git
$ cd signalfx-python/
$ pip install -e .

Usage

This client library provides programmatic access to SignalFx's APIs:

  • the data ingest API;
  • the metadata REST API;
  • the SignalFlow API.

You start by instantiating a signalfx.SignalFx() object, which then gives you access to the API client that you want:

import signalfx

sfx = signalfx.SignalFx(api_endpoint='https://api.{REALM}.signalfx.com',
        ingest_endpoint='https://ingest.{REALM}.signalfx.com',
        stream_endpoint='https://stream.{REALM}.signalfx.com')


# For the ingest API
ingest = sfx.ingest('ORG_TOKEN')

# For the REST API
rest = sfx.rest('API_TOKEN')

# For the SignalFlow API
flow = sfx.signalflow('ACCESS_TOKEN')

If no endpoints are set manually, this library uses the us0 realm by default. If you are not in this realm, you will need to explicitly set the endpoint urls above. To determine if you are in a different realm and need to explicitly set the endpoints, check your profile page in the SignalFx web application. You will also need to specify an access token when requesting one of those clients. For the ingest client, you need to specify your organization access token (which can be obtained from the SignalFx organization you want to report data into). For the REST API, you must use your user access token. For the SignalFlow client, either an organization access token or a user access token may be used. For more information on access tokens, see the API's Authentication documentation.

Reporting data

Basic usage of the library for reporting data goes as follows:

import signalfx

with signalfx.SignalFx().ingest('ORG_TOKEN') as sfx:
    sfx.send(
        gauges=[
          {'metric': 'myfunc.time',
           'value': 532,
           'timestamp': 1442960607000},
          ...
        ],
        counters=[
          {'metric': 'myfunc.calls',
           'value': 42,
           'timestamp': 1442960607000},
          ...
        ],
        cumulative_counters=[
          {'metric': 'myfunc.calls_cumulative',
           'value': 10,
           'timestamp': 1442960607000},
          ...
        ])

If you're sending data from multiple places in your code, you should create your ingest client once and use it throughout your application. Each ingest client instance has an internal queue of datapoints and events that need to be sent to SignalFx, as well as an internal thread draining that queue. When you no longer need the client instance, make sure you call .stop() on it to ensure the queue is fully drained.

import signalfx

sfx = signalfx.SignalFx().ingest('ORG_TOKEN')
try:
    sfx.send(...)
    sfx.send(...)
finally:
    # Make sure that everything gets sent.
    sfx.stop()

The timestamp must be a millisecond precision timestamp; the number of milliseconds elapsed since Epoch. The timestamp field is optional, but strongly recommended. If not specified, it will be set by SignalFx's ingest servers automatically; in this situation, the timestamp of your datapoints will not accurately represent the time of their measurement (network latency, batching, etc. will all impact when those datapoints actually make it to SignalFx).

When sending datapoints with multiple calls to send(), it is recommended to re-use the same SignalFx client object for each send() call.

If you must use multiple client objects for the same token, which is not recommended, it is important to call stop() after making all send() calls. Each SignalFx client object uses a background thread to send datapoints without blocking the caller. Calling stop() will gracefully flush the thread's send queue and close its TCP connections.

Sending multi-dimensional data

Reporting dimensions for the data is also optional, and can be accomplished by specifying a dimensions parameter on each datapoint containing a dictionary of string to string key/value pairs representing the dimensions:

import signalfx

with signalfx.SignalFx().ingest('ORG_TOKEN') as sfx:
    sfx.send(
        gauges=[
          {
            'metric': 'myfunc.time',
            'value': 532,
            'timestamp': 1442960607000,
            'dimensions': {'host': 'server1', 'host_ip': '1.2.3.4'}
          },
          ...
        ], ...)

See examples/generic_usecase.py for a complete code sample showing how to send data to SignalFx.

Sending events

Events can be sent to SignalFx via the send_event() function. The event type must be specified, and dimensions and extra event properties can be supplied as well.

import signalfx

with signalfx.SignalFx().ingest('ORG_TOKEN') as sfx:
    sfx.send_event(
        event_type='deployments',
        dimensions={
            'host': 'myhost',
            'service': 'myservice',
            'instance': 'myinstance'},
        properties={
            'version': '2015.04.29-01'})

Metric metadata and tags

The library includes functions to search, retrieve, and update metric metadata and tags. Deleting tags is also supported.

import signalfx

with signalfx.SignalFx().rest('ORG_TOKEN') as sfx:
    sfx.update_tag('tag_name',
                   description='An example tag',
                   custom_properties={'version': 'some_number'})

AWS integration

Optionally, the client may be configured to append additional dimensions to all metrics and events sent to SignalFx. One use case for this is to append the AWS unique ID of the current host as an extra dimension. For example,

import signalfx
from signalfx.aws import AWS_ID_DIMENSION, get_aws_unique_id

sfx = signalfx.SignalFx().ingest('ORG_TOKEN')

# This dimension will be added to all datapoints sent.
sfx.add_dimensions({AWS_ID_DIMENSION: get_aws_unique_id()})

try:
    sfx.send(
        gauges=[
          {
            'metric': 'myfunc.time',
            'value': 532,
            'timestamp': 1442960607000
            'dimensions': {'host': 'server1', 'host_ip': '1.2.3.4'}
          },
        ])
finally:
    sfx.stop()

Pyformance reporter

pyformance is a Python library that provides CodaHale-style metrics in a very Pythonic way. We offer a reporter that can report the pyformance metric registry data directly to SignalFx.

from signalfx.pyformance import (count_calls, count_calls_with_dims,
                                 gauge, SignalFxReporter)

@count_calls
def callme():
    # whatever
    pass

@count_calls_with_dims(dimension_key="dimension_value")
def callme_with_dims():
    # whatever
    pass

sfx = SignalFxReporter(token='ORG_TOKEN')
sfx.start()

callme()
callme()
callme_with_dims()
callme_with_dims()
gauge('test').set_value(42)

See examples/pyformance_usecase.py for a complete code example using Pyformance.

Executing SignalFlow computations

SignalFlow is SignalFx's real-time analytics computation language. The SignalFlow API allows SignalFx users to execute real-time streaming analytics computations on the SignalFx platform. For more information, head over to our Developers documentation:

The SignalFlow client accepts either an Organization Access Token or a User API Token. Executing a SignalFlow program is very simple with this client library:

import signalfx

program = "data('cpu.utilization').mean().publish()"
with signalfx.SignalFx().signalflow('ACCESS_TOKEN') as flow:
    print('Executing {0} ...'.format(program))
    computation = flow.execute(program)
    for msg in computation.stream():
        if isinstance(msg, signalfx.signalflow.messages.DataMessage):
            print('{0}: {1}'.format(msg.logical_timestamp_ms, msg.data))
        if isinstance(msg, signalfx.signalflow.messages.EventMessage):
            print('{0}: {1}'.format(msg.timestamp_ms, msg.properties))

Metadata about the streamed timeseries is received from .stream(), but it is automatically intercepted by the client library and made available through the Computation object returned by execute():

if isinstance(msg, signalfx.signalflow.messages.DataMessage):
    for tsid, value in msg.data.items():
        metadata = computation.get_metadata(tsid)
        # Display metadata and datapoint value as desired

For more examples of how to execute SignalFlow computation with this library, interpret and use the returned stream messages, you can look at the simple example in examples/signalflow/basic.py or at the SignalFlow CLI and its implementation which uses this library.

Building a Pandas DataFrame from SignalFlow output

With the SignalFlow output being programmatically accessible, it's easy to convert this data into any form that you need for further use or analysis. One such use case is to build a Pandas DataFrame with the computation's output. For a complete example of how to do this, see examples/signalflow/dataframe.py.

Known Issues

Sending only 1 datapoint and not seeing it in the chart

The reason you are not seeing the metrics in the chart is because the script that is calling the Python client module is exiting right after calling the send method. The Python client library is mainly targeted towards sending a continuous stream of metrics and was implemented to be asynchronous.

To work around this problem (most common in short-lived scripts for example), register an atexit function to cleanly stop the datapoint sending thread when your program exits:

import atexit
import signalfx

sfx = signalfx.SignalFx().ingest('ORG_TOKEN')
atexit.register(sfx.stop)

SSLError when working with tags, metrics, dimensions, metrictimeseries, organization

ERROR:root:Posting to SignalFx failed.
SSLError: hostname 'api.signalfx.com' doesn't match either of '*.signalfuse.com', 'signalfuse.com'.

Root Cause: SignalFx's API endpoints (api.signalfx.com, ingest.signalfx.com and stream.signalfx.com) have SSL SNI enabled and the urllib3 module in Python versions prior to 2.7.8 had a bug that causes the above issue. This was fixed in later versions of Python; we recommend using Python 2.7.9 or newer when using this library.

Exceeding int64 integer sizes

The protocol buffer used to transmit data through the ingest API restricts integers and longs to (-(2**63) to (2**63)-1). long values in Python 2.x and int values in 3.x can exceed these values. Any value or property value less than -(2**63) or greater than (2**63)-1 will raise a ValueError exception.

License

Apache Software License v2. Copyright © 2014-2019 SignalFx