/sybil

columnar storage + NoSQL OLAP engine

Primary LanguageGoOtherNOASSERTION

Sybil is an append only analytics datastore with no up front table schema requirements; just log JSON records to a table and run queries. Written in Go, sybil is designed for fast full table scans of multi-dimensional data on one or more machines.

more documentation is available on the wiki and in the repo.

if sybil by itself is uninteresting (who wants to run command line queries, anyways?), sybil is a supported backend for snorkel

advantages

  • Easy to setup and get data inside sybil - just pipe JSON on stdin to sybil
  • Supports histograms (and percentiles), standard deviations and time series roll ups
  • Runs fast full table queries (performance notes)
  • Lower disk usage through per column compression schemes
  • Serverless design with controlled memory usage
  • Per table retention policies (specify max age and/or size of tables)
  • Timestamp resistant query cache (optional) that avoids recomputation

disadvantages

  • JOINS not supported
  • No UPDATE operation on data - only writes

installation

go get github.com/logv/sybil

additional information