Pinned Repositories
atum
A dynamic data completeness and accuracy library at enterprise scale for Apache Spark
enceladus
Dynamic Conformance Engine
hyperdrive
Extensible streaming ingestion pipeline on top of Apache Spark
spline
Data Lineage Tracking And Visualization Solution
Aho_Corasick
Aho Corasick algorithm implementations (search & replace)
benedeki
Genereal repo
ModryZivot
nba_enricher
Gather NBA players and their stats, then scan's tweets ametnioning top players and add extra info into these.
OccurrenceCounter
OccurrenceCounter is generic class that serves the purpose of continuously counting the number of occurrences of item kinds. The class is able to return the top and bottom items at any moment; top being the items with the highest occurrence count, bottom with the lowest.
sandbox
benedeki's Repositories
benedeki/Aho_Corasick
Aho Corasick algorithm implementations (search & replace)
benedeki/benedeki
Genereal repo
benedeki/ModryZivot
benedeki/nba_enricher
Gather NBA players and their stats, then scan's tweets ametnioning top players and add extra info into these.
benedeki/OccurrenceCounter
OccurrenceCounter is generic class that serves the purpose of continuously counting the number of occurrences of item kinds. The class is able to return the top and bottom items at any moment; top being the items with the highest occurrence count, bottom with the lowest.
benedeki/sandbox