ttelfer's Stars
nicosuave/buenavista
A Proxy Server for DuckDB & Postgres in Python with support for querying SQLMesh metrics
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
ucbepic/docetl
A system for agentic LLM-powered data processing and ETL
warpstreamlabs/bento
Fancy stream processing made operationally mundane. This repository is a fork of the original project before the license was changed.
mikekelly/AgentK
An autoagentic AGI that is self-evolving and modular.
duckdb/pg_duckdb
DuckDB-powered Postgres for high performance apps & analytics.
abiosoft/colima
Container runtimes on macOS (and Linux) with minimal setup
k-korn/misc-scripts
Miscellaneous scripts
zarf-dev/zarf
DevSecOps for Air Gap & Limited-Connection Systems. https://zarf.dev/
readysettech/readyset
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the results of cached select statements and incrementally updates these results over time as the underlying data changes.
AlexIoannides/llm-regression
Exploring the classical regression capabilities of LLMs.
eakmanrq/sqlframe
Turning PySpark Into a Universal DataFrame API
Sanofi-Public/emrflow
EMRFlow is designed to simplify the process of running PySpark jobs on Amazon EMR (Elastic Map Reduce).
mikebrady/shairport-sync
AirPlay and AirPlay 2 audio player
linkedin/openhouse
Open Control Plane for Tables in Data Lakehouse
ArroyoSystems/arroyo
Distributed stream processing engine in Rust
paradedb/paradedb
Postgres for Search and Analytics
walthowd/husbzb-firmware
Nortek GoControl HUSBZB-1 / EM3581 Firmware update image
bytewax/bytewax
Python Stream Processing
AlexIoannides/pyspark-example-project
Implementing best practices for PySpark ETL jobs and applications.
dokan-dev/dokany
User mode file system library for windows with FUSE Wrapper
Aircoookie/WLED
Control WS2812B and many more types of digital RGB LEDs with an ESP8266 or ESP32 over WiFi!
DataThirstLtd/databricksConnectDocker
Docker Images with Databricks Connect Ready to go
guillermo-navas-palencia/optbinning
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
uber/fiber
Distributed Computing for AI Made Simple
databricks/xgb-regressor
MLflow XGBoost Regressor
amitkaps/hackermath
Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way
yandex/odyssey
Scalable PostgreSQL connection pooler
tomkerkhove/promitor
Bringing Azure Monitor metrics where you need them.
kedacore/keda
KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes