buryat's Stars
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
meilisearch/meilisearch
A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
metabase/metabase
The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
open-guides/og-aws
📙 Amazon Web Services — a practical guide
herrbischoff/awesome-macos-command-line
Use your macOS terminal shell to do awesome things.
rlabbe/Kalman-and-Bayesian-Filters-in-Python
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions.
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
questdb/questdb
QuestDB is an open source time-series database for fast ingest and SQL queries
trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
haifengl/smile
Statistical Machine Intelligence & Learning Engine
airbnb/knowledge-repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
tdunning/t-digest
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
facebookarchive/LogDevice
Distributed storage for sequential data
tonymorris/fp-course
Functional Programming Course
vas3k/vas3k.club
No bullshit IT community with private membership
stanch/reftree
Automatically generated diagrams and animations for Scala data structures
saddle/saddle
SADDLE: Scala Data Library
geohot/twitchchess
like twitchslam, for chess
plynx-team/plynx
PLynx is a domain agnostic platform for managing reproducible experiments and data-oriented workflows.
delight-im/Knowledge
Random pieces of knowledge — with anecdotes and quotes
groupon/spark-metrics
A library to expose more of Apache Spark's metrics system
barzan/dbseer
DBSeer
SerenityOS/yaksplained
All the SerenityOS Yaks, explained
maropu/spark-sql-flow-plugin
Visualize column-level data lineage in Spark SQL
grdl/git-get
A better way to clone, organize and manage multiple git repositories
swoop-inc/spark-records
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
buryat/loglog
LogLog and HyperLogLog algorithms implementations
hammerlab/magic-rdds
Miscellaneous functionality for manipulating Apache Spark RDDs.
isarn/isarn-sketches
Sketching data structures for scala, including t-digest
jfmyers9/blogs