/go-uspclient

Golang USP Client

Primary LanguageGoApache License 2.0Apache-2.0

go-uspclient

Golang USP Client

Protocol

USP is based on a secure websocket connection to the relevant LimaCharlie datacenter and is formatted in JSON.

Establish Connection

Upon connecting, the client will issue a connection handshake to the server containing identifying information.

ConnectionHeader Format

  • OID: Organization ID this client belongs to.
  • IID: Installation Key ID authorizing this client.
  • Hostname: Hostname of the client.
  • Platform: name of the Platform sensors this client will create.
  • Architecture: name of the Architecture sensors this client will create.
  • Mapping: metadata indicating how to map fields coming from this client into JSON in LimaCharlie (see below).
  • Mappings: a list of Mapping (as defined above) that will be attempted, the first one to match on the ParsingRE field will be used.
  • SensorSeedKey: an arbitrary string used in generating the sensor IDs this client will create (see below).
  • IsCompressed: whether the cloud can expect the content shipped to it is compressed.
  • DataFormat: the rest of the connection can support msgpack or json.
  • InstanceID: an identifier unique to the instance of the adapter, used an indication the state of the adapter was reset.

Mapping Metadata

  • ParsingRE: regular expression with named capture groups. The name of each group will be used as the key in the converted JSON parsing.
  • SensorKeyPath: indicates which component of the events represent unique sensor identifiers.
  • SensorHostnamePath: indicates which component of the event represents the hostname of the resulting Sensor in LimaCharlie.
  • EventTypePath: indicates which component of the event represents the Event Type of the resulting event in LimaCharlie.
  • EventTimePath: indicates which component of the event represents the Event Time of the resulting event in LimaCharlie.
  • IsRenameOnly: if true, indicates the field mappings defined here should only be renamed fields from the original event (and not completely replacing the original event).
  • Mappings: a list of field remapping to be performed:
    • SourceField: the source component to remap.
    • DestinationField: what the SourceField should be mapped to in the final event.

References to field or path in the Mapping represent slash-separated (/) elements of a string indicating where a recursive JSON object a given value is. For example, the EventTypePath equal to metadata/original/action applied to the following event would result in foo being used as an Event Type:

{
    "metadata": {
        "original": {
            "action": "foo",
            "bar": "foobar"
        },
        "another": {
            "random": "field"
        }
    }
}

Indexing Metadata

  • EventsIncluded: an optional inclusion list of event types this index applies to.
  • EventsExcluded: an optional exclusion list of event types this index does not apply to.
  • Path: the path within the event who's value is to be indexed.
  • Regexp: the regular expression, with a single capture group, to extract the value to index from the Path.
  • IndexType: the type of index this should apply to.

Authorization

Clients are authorized to connect based on the Organization ID (OID) and Installation Key ID (IID). Deleting the IID will prevent the client from connecting with it.

Sensor IDs

USP Clients generate LimaCharlie Sensors at runtime. The ID of those sensors (SID) is generated based on the Organization ID (OID) and the Sensor Seed Key.

This implies that if want to re-key an IID (perhaps it was leaked), you may replace the IID with a new valid one. As long as you use the same OID and Sensor Seed Key, the generated SIDs will be stable despite the IID change.

Send Data

Once the handshake has been sent, the client can send events.

DataMessage

Receive Feedback

The server may then send control messages to the client.

ControlMessage

The Verb element of a ControlMessage indicates the type of feedback and content that can be expected in the rest of the message.

Flow Control

Starting with version 2 and up, USP supports a TCP-like flow control concept. The new ControlMessage verb of ControlMessageFLOW (fl) with a component WindowSize (win) which is an indicator from the LimaCharlie backend that a larger Window Size (max number of unacknowledged messages in transit) should be set in the client. This is a way for the LimaCharlie backend to bring up to speed a given client in a controlled way (without flooding the LimaCharlie backend and requiring aggressive backoff). In version 2, the client should begin execution with a max window size of 1 and wait for feedback from LimaCharlie for when to increase it and by how much to increase it.

Acknowledging DataMessage

DataMessage is expected to contain a Sequence Number monotonically increasing. A ControlMessage with a Verb value of ControlMessageACK is used by the server to indicate all DataMessages with a Sequence Number equal or lower has been received and processed.

Custom Formatting

Data sent via USP can be formatted in many different ways. Data is processed in a specific order as a pipeline:

  1. Regular Expression with named capture groups parsing a string into a JSON object.
  2. Built-in (in the cloud) LimaCharlie parsers that apply to specific Platform values (like carbon_black).
  3. The various "extractors" defined, like EventTypePath, EventTimePath, SensorHostnamePath and SensorKeyPath.
  4. Custom Mappings directives provided by the client.