Implementation of the BARE message protocol in Ruby
BARE is a simple efficient binary encoding. Its primary advantage over json is its structured nature and smaller messages sizes. Messages are smaller because they do not describe themselves (no key names). This means the same message schema MUST be present on the sender and receiver. Messages are around 37%-60% smaller than json equivalents.
This implementation is complete but hasn't be rigorously tested for compatibility with another implementation. Please file an issue here on github if you find a bug. Feel free to submit a PR with your own fixes or improvements, just be sure to run the tests.
# Install locally
gem install bare-rb
# Add to Gemfile
gem 'bare-rb'
And then just require the gem when you want to use it:
# Define your schema: here a variable length array of unsigned 1 byte integers
require 'bare-rb'
schema = Bare.Array(Bare.U8)
output = Bare.encode([1,2,3,4], schema)
puts output.inspect
=> "\x04\x01\x02\x03\x04"
# You must know what schema was used to encode data to decode it
Bare.decode(output, schema)
=> [1, 2, 3, 4]
If you need to create referential types you can use symbols as placeholders. You'll also need to wrap your object in a call to Bare.Schema.
require 'bare-rb'
# Schema accepts a hash of symbols starting with a capital letter to bare types.
# You'll need to specify which type in a schema you are referring to when encoding/decoding.
# Type 1 is an int, Type 2 is a reference to Type 1 plus a string.
schema = Bare.Schema({
Type2: Bare.Struct({
t1: :Type1,
name: Bare.String
}),
Type1: Bare.Int
})
# Notice below we specify schema[:Type2] as it now ambiguous to just supply schema
output = Bare.encode({t1: 5, name: "Some Name"}, schema[:Type2])
puts schema.to_s
=> "type Type2 { t1: Type1\n name: string\n}\ntype Type1 int\n"
# ./test3.schema
type Customer {
name: string
email: string
orders: []{
orderId: i64
quantity: i32
}
metadata: map[string]data
}
require 'bare-rb'
schema = Bare.parse_schema('./test3.schema')
# Schema returns a hash so you must select which type in the hash you want
# {:Customer => Bare.Struct(...) }
msg = {name: "和製漢字",
email: "n8 AYT u.northwestern.edu",
orders: [{orderId: 5, quantity: 11},
{orderId: 6, quantity: 2},
{orderId: 123, quantity: -5}],
metadata: {"Something" => "\xFF\xFF\x00\x01".b, "Else" => "\xFF\xFF\x00\x00\xAB\xCC\xAB".b}
}
encoded = Bare.encode(msg, schema[:Customer])
decoded = Bare.decode(encoded, schema[:Customer])
msg == decoded
# True
This package can generate random bare schema + inputs to go with them. You can use this to test your own non-ruby bare package. Eg.
require 'bare-rb'
schema, bin, type_in_schema = Bare.generative_test("./schema.bare", "./input.bin")
puts type_in_schema
==> :someTypeInSchema
Schema.bare and input.bin will be written to the file system. Then you can write a test in your native language
with open("./input.bin", 'rb') as bin:
with open("./schema.bare", 'rb') as schema:
schema_text = schema.read()
binary = bin.read()
encoded_decoded_binary = Bare.encode(Bare.decode(binary, schema, :someTypeInSchema), :someTypeInSchema))
assert encoded_decoded_binary = binary
All public interfaces are semantically versioned. The major version number will be bumped if I change them.
The public interfaces are
- Bare.encode
- Bare.decode
- Bare.parse_schema
- Bare.generative_test (returns a random schema + binary suitable for it + which type in schema the binary is for)
- BareException (base exception class ONLY)
- Bare.Schema Bare.Int, Bare.Struct etc...
- The to_s method on schemas. Converts schema back into text format. The exact text output for a given schema may change without a major or minor version bump BUT the schemas will be equivalent.
There are internal encode/decodes one each class like int and struct. They could change without notice. Use Bare.encode/decode.
- uint
- int
- u8 u16 u32 36
- i8 i16 i32 i64
- f32 f64
- bool
- enum
- string
- data
- data
- void
- optional
- array
- array
- map type -> type
- union (type1 | type2 | type3)
- struct
- schema
Variable length unsigned integer
schema = Bare.Uint
output = Bare.encode(5, schema)
puts output.inspect
=> "\x05".b
Bare.decode(output, schema)
=> 5
Variable length signed integer
schema = Bare.Int
output = Bare.encode(-2, schema)
puts output.inspect
=> "\x03".b
Bare.decode(output, schema)
=> -2
U8, U16, U32, U64
schema = Bare.U16
output = Bare.encode(9, schema)
puts output.size
=> 2
Bare.decode(output, schema)
=> 9
I8, I16, I32, I64
schema = Bare.I16
output = Bare.encode(-9, schema)
puts output.size
=> 2
Bare.decode(output, schema)
=> -9
F32, F64
schema = Bare.F32
output = Bare.encode(-13.3, schema)
puts output.size
=> 4
Bare.decode(output, schema)
=> -13.3
Ruby true or false
schema = Bare.Bool
output = Bare.encode(false, schema)
puts output.inspect
=> "\x00\x00"
Bare.decode(output, schema)
=> false
Accepts a hash from ints to ruby objects. Ints must be positive.
schema = Bare.Enum({1 => "abc", 5 => :cow, 16382 => 123})
output = Bare.encode(:cow, schema)
puts output.inspect
=> "\x05"
Bare.decode(output, schema)
=> :cow
Accepts a string and if it isn't UTF-8 attempts to convert it using rubies msg.encode("UTF-8").b
.
This may fail for some encodings. If it does you may want to use the data type instead.
schema = Bare.String
output = Bare.encode("You need to construct additional pylons", schema)
puts output.bytes.inspect
=> [39, 89, 111, 117, 32, 110, 101, 101, 100, 32, 116, 111, 32, 99, 111, 110, 115, 116, 114, 117, 99, 116, 32, 97, 100, 100, 105, 116, 105, 111, 110, 97, 108, 32, 112, 121, 108, 111, 110, 115]
Bare.decode(output, schema)
=> "You need to construct additional pylons"
Binary data of a known length
schema = Bare.DataFixedLen(5) # length is 5
output = Bare.encode("\x00\x01\x02\x03\x04".b, schema)
puts output.inspect
=> "\x00\x01\x02\x03\x04"
Bare.decode(output, schema)
=> "\x00\x01\x02\x03\x04"
Binary data of a variable length
schema = Bare.Data
output = Bare.encode("\x00\x01\x02\x03\x04".b, schema)
puts output.inspect
=> "\x05\x00\x01\x02\x03\x04"
Bare.decode(output, schema)
=> "\x00\x01\x02\x03\x04"
"A type with zero length. It is useful to create user-defined types which alias void to create discrete options in a tagged union which do not have any underlying storage." -source
schema = Bare.Void
output = Bare.encode(nil, schema)
puts output.inspect
=> ""
Bare.decode(output, schema)
=> nil
schema2 = Bare.Union({0 => Bare.Uint, 1 => Bare.Void})
output = Bare.encode({type: Bare.Void, value: nil}, schema2)
puts output.inspect
=> "\x01" # Notice only the type encoding from union, no other data to represent void
decoded = Bare.decode(output, schema2)
puts decode.inspect
=> { value: nil, type: #<BareTypes::Void:0x00007ffe9a20b868> }
decoded[:type] == Bare.Void
=> true
A type which may or may not be present. Pass nil to indicate the value if not present on the normal type of the value if it is.
schema = Bare.Optional(Bare.U8)
output = Bare.encode(5, schema)
puts output.inspect
=> "\xFF\x05"
Bare.decode(output, schema)
=> 5
schema = Bare.Optional(Bare.U8)
output = Bare.encode(nil, schema)
puts output.inspect
=> "\x00"
Bare.decode(output, schema)
=> nil
Array of a set type of a set length
schema = Bare.ArrayFixedLen(Bare.U8, 3)
output = Bare.encode([5,3,1], schema)
puts output.inspect
=> "\x05\x03\x01"
Bare.decode(output, schema)
=> [5,3,1]
Mapping from one type to another. First arg is from type, second is to type. When decoded it returns a hash.
Ruby hashes have order, if a key is specified more than one in the input or in encoded data the final value is used.
schema = Bare.Map(Bare.Uint, Bare.String)
output = Bare.encode({3 => "Ut-oh"}, schema)
puts output.inspect
=> "\x01\x03\x05Ut-oh"
Bare.decode(output, schema)
=> {3=>"Ut-oh"}
schema = Bare.Map(Bare.Uint, Bare.String)
output = Bare.encode({3 => "Ut-oh", 3 => "Noway"}, schema)
puts output.inspect
=> "\x01\x03\x05Noway"
Unions can represent one type in a set of types. When encoding a union you must specify what type you are encoding as a hash.
Unions also must enumerate their types as positive integers.
Unions cannot have the same type twice.
schema = Bare.Union({1 => Bare.Uint, 5 => Bare.U16})
output = Bare.encode({type: Bare.U16, value: 6}, schema)
puts output.inspect
=> "\x05\x06\x00"
decoded = Bare.decode(output, schema)
puts decoded.inspect
=> {:value=>5, :type=>#<BareTypes::U16:0x00007ffe9511ab70>}
decoded[:type] == Bare.U16
=> true
Structs are a collection of named fields. They are encoded and decoded as hashes. Order of keys in the hash matters.
Keys of these hashes must be a ruby symbol like :this
schema = Bare.Struct({int: Bare.I8, uint: Bare.Uint })
output = Bare.encode({int: -5, uint: 8}, schema)
puts output.bytes.inspect
=> [251, 8]
Bare.decode(output, schema)
=> {:int=>-5, :uint=>8}
A schema is a hash of symbols to bare types.
schema = Bare.Schema({
Type2: Bare.Struct({
t1: :Type1,
name: Bare.String
}),
Type1: Bare.Int
})
Schemas can contain references to one key within another key (See :Type1
above).
schema = Bare.Schema({ Type2: Bare.Struct({ t1: :Type1, name: Bare.String }), Type1: Bare.Int })
puts schema.to_s
>>> "type Type2 { t1: Type1\n name: string\n}\ntype Type1 int\n"
This package can generate random schema with random binaries. This is useful if you're testing another bare implementation.
schema, binary1, type_in_schema = Bare.generative_test
puts schema
==>
type XZsLhtB7Q8n3e4 data
type McRM71 { QJBfn8L54J4Mt4b0TFfZYB22NQ5W0636MM6M0gMVhMBM7d0U4Wrxj7Y3E6lu4Ze2Ftw1mu92: ATBAh2lMsHI8
QM: Ka
Ul83j2o4lZKq8rA: f32
BQUPwKc: u64
U9N: ZzhZ
}
type N7Vz7 f32
type Djv981Ym5 f64
type QG [30]data
type ZzhZ u8
type Ka i8
type B6vHGS2 [31]data
type EK0aM i32
type ATBAh2lMsHI8 data
type ZkCCc data
# see pretty random (:
puts type_in_schema
==> :XZsLhtB7Q8n3e4
ruby_object = Bare.decode(binary, schema, type_in_schema)
binary2 = Bare.encode(ruby_object, schema, type_in_schema)
assert_equal binary, binary
Includes all of the following
Occurs when void is used inappropriately.
Occurs when too few args are passed to a type (eg.a struct with 0 fields)
Occurs when too a value can't be encoded in the maximum allowed space of a uint/int.
Map keys must use a primitive type which is not data or data.
Occurs when an enum is passed two keys with the same value ie { 0 => :abc, 1 => :abc }
Occurs in many cases when the data provided to encode doesn't match the schema.
Bools can only be encoded as a single byte of exactly 0x00 or 0x01. Anything else is an error.
Your schema is infinitely large
type Type1 {
fielda: Type2
}
type Type2 {
fieldb: Type1
fieldc: u8,
}
A schema can be recursive though. This is allowed
type Type1 {
fielda: Type2
}
type Type2 {
fieldb: option<Type1>
fieldc: u8,
}
Since fieldb
is optional the schema isn't a fixed size but can terminate.
We essentially search your schema for a cycle with the following considered edges:
- References in struct fields
- Unions with a single variant
- Arrays types
Maps are not included because they may have length 0. I think it's possible to create a union where both variants lead to the same type (thus the cycle is missed) but I haven't tested this yet. So it may not give the correct error.