IgnorantWalking's Stars
spiceai/spiceai
A self-hostable CDN for databases. Spice provides a unified SQL query interface and portable runtime to locally materialize, accelerate, and query datasets from any database, data warehouse, or data lake.
wso2/reference-architecture
The Reference Architecture for Agility is a technology-neutral logical architecture based on a disaggregated cloud-based model.
newrelic/nr1-learn-nrql
NR1 learn NRQL helps New Relic Customers quickly learn our custom query language - NRQL
simonw/llm
Access large language models from the command-line
python-jsonschema/check-jsonschema
A CLI and set of pre-commit hooks for jsonschema validation with built-in support for GitHub Workflows, Renovate, Azure Pipelines, and more!
z3z1ma/alto
Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plugins, and create a data reservoir whereby you can extract once and replay to as many destinations as you want.
terrastruct/d2
D2 is a modern diagram scripting language that turns text to diagrams.
skyzh/mini-lsm
A tutorial of building an LSM-Tree storage engine in a week!
sematic-ai/sematic
An open-source ML pipeline development platform
dlt-hub/dlt
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
gunnarmorling/1brc
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
ept/hermitage
What are the differences between the transaction isolation levels in databases? This is a suite of test cases which differentiate isolation levels.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
dagster-io/dagster-open-platform
Dagster Labs' open-source data platform, built with Dagster.
getzola/zola
A fast static site generator in a single binary with everything built-in. https://www.getzola.org
readysettech/readyset
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the results of cached select statements and incrementally updates these results over time as the underlying data changes.
mit-pdos/noria
Fast web applications through dynamic, partially-stateful dataflow
jamsocket/plane
A distributed system for running WebSocket services at scale.
TobikoData/sqlmesh
Efficient data transformation and modeling framework that is backwards compatible with dbt.
bitpicky/dbt-sugar
dbt-sugar is a CLI tool that allows users of dbt to have fun and ease performing actions around dbt models
Canner/vulcan-sql
Data API Framework for AI Agents and Data Apps
sodadata/soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
ets-labs/python-dependency-injector
Dependency injection framework for Python
json-schema-org/understanding-json-schema
A website aiming to provide more accessible documentation for JSON schema.
erika-e/dbt-tips
Collection of dbt Tips and Tricks
elementary-data/dbt-data-reliability
dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Snowflake-Labs/dbt_constraints
This package generates database constraints based on the tests in a dbt project
datonic/datadex
📦 Serverless and local-first Open Data Platform
rilldata/rill
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
z3z1ma/dbt-feature-flags
Feature Flags in dbt models