sprohaska-vouch's Stars
typst/typst
A new markup-based typesetting system that is powerful and easy to learn.
duckdb/duckdb
DuckDB is an analytical in-process SQL database management system
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
google/cadvisor
Analyzes resource usage and performance characteristics of running containers.
apache/arrow
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
debezium/debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
grafana/pyroscope
Continuous Profiling Platform. Debug performance issues down to a single line of code
tobymao/sqlglot
Python SQL Parser and Transpiler
longhorn/longhorn
Cloud-Native distributed storage built on and for Kubernetes
ibis-project/ibis
the portable Python dataframe library
red-data-tools/YouPlot
A command line tool that draw plots on the terminal.
dlt-hub/dlt
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
apache/paimon
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
davidgasquez/awesome-duckdb
🦆 A curated list of awesome DuckDB resources
ryanoasis/powerline-extra-symbols
:arrow_forward: Extra glyphs for your powerline separators
openebs/mayastor
Dynamically provision Stateful Persistent Replicated Cluster-wide Fabric Volumes & Filesystems for Kubernetes that is provisioned from an optimized NVME SPDK backend data storage stack.
tabular-io/iceberg-kafka-connect
BauplanLabs/quack-reduce
A playground for running duckdb as a stateless query engine over a data lake
denglend/decode345
Honeywell 345 Mhz decoding
getindata/dbt-flink-adapter
Adapter for dbt that executes dbt pipelines on Apache Flink
dagster-io/awesome-dagster
All things awesome related to Dagster!
rustyconover/duckdb-shellfs-extension
DuckDB extension allowing shell commands to be used for input and output.
outerbounds/metaflow-tools
Tools and utilities for operating Metaflow in production
anna-geller/prefect-streaming
Example project demonstrating deployment patterns for real-time streaming workflows with Prefect 2.0
datarootsio/terraform-aws-ecs-dagster
A terraform module that deploys Dagster to AWS, using ECS.
anelendata/dbt-ksql
dbt ksqlDB adapter
PayLead/dagster-nomad
Nomad launcher/executor for Dagster
hotio/sysinfo.sh
Bash script that tries to give as much info as possible about your Linux system.
moj-analytical-services/splink_graph
pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other domains)
slopp/dagster_kafka_demo
Micro-batch processing of streams using Dagster and Kafka