/probably

Probabilistic Data Structures in Python (originally presented at PyData 2013)

Primary LanguagePythonMIT LicenseMIT

Probably: Probabilistic Data Structures for Realtime Analytics

Package containing some useful probabilistic data structures:

  • BloomFilter
  • CountMinSketch
  • CountdownBloomFilter
  • HyperLogLog (HLL)
  • TemporalDailyBloomFilter

Build with:

python setup.py build_ext --inplace