Probabilistic data structures for large or streaming data sets.
This module exports the following sketches:
- Bloom - Bloom filters test for (approximate) set membership.
- CountMin - Count-min sketches estimate frequency counts for streaming values.
- CountMinMean - Count-min-mean sketches modify the estimates of count-min sketches to account for bias.
- NGram - The N-Gram sketch simply counts all n-character strings (default 2) in text data.
- StreamSummary - The StreamSummary sketch tracks top-k frequent values.
- TDigest - The T-digest estimates a variable-width histogram for quantile and cdf estimation.
To use datalib-sketch in the browser, you need to build the datalib-sketch.js and datalib-sketch.min.js files. We assume that you have npm installed.
- Run
npm install
in the datalib-sketch folder to install dependencies. - Run
npm run build
. This will invoke browserify to bundle the source files into datalib-sketch.js, and then uglify-js to create the minified datalib-sketch.min.js.