treff7es's Stars
apache/superset
Apache Superset is a Data Visualization and Data Exploration Platform
derailed/k9s
🐶 Kubernetes CLI To Manage Your Clusters In Style!
plotly/dash
Data Apps & Dashboards for Python. No JavaScript Required.
bloomberg/memray
Memray is a memory profiler for Python
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
datahub-project/datahub
The Metadata Platform for your Data and AI Stack
sqlfluff/sqlfluff
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
apache/zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
apache/pinot
Apache Pinot - A realtime distributed OLAP datastore
amundsen-io/amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
andialbrecht/sqlparse
A non-validating SQL parser module for Python
SchemaStore/schemastore
A collection of JSON schema files including full API
grantjenks/python-diskcache
Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.
google/zetasql
ZetaSQL - Analyzer Framework for SQL
fabiocaccamo/python-benedict
:blue_book: dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xls, xml, yaml), s3 support and many utilities.
reata/sqllineage
SQL Lineage Analysis Tool powered by Python
cloudflare/hellogopher
Hellogopher: "just clone and make" your conventional Go project
macbre/sql-metadata
Uses tokenized query returned by python-sqlparse and generates query metadata
linkedin/kafka-tools
A collection of tools for working with Apache Kafka.
laughingman7743/PyAthena
PyAthena is a Python DB API 2.0 (PEP 249) client for Amazon Athena.
lelit/pglast
PostgreSQL Languages AST and statements prettifier: master branch covers PG10, v2 branch covers PG12, v3 covers PG13, v4 covers PG14, v5 covers PG15, v6 covers PG16, v7 covers PG17
lyft/presto-gateway
A load balancer / proxy / gateway for prestodb
linkedin/cruise-control-ui
Cruise Control Frontend (CCFE): Single Page Web Application to Manage Large Scale of Kafka Clusters
linkedin/transport
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.
ananthdurai/schemata
Schema modelling framework for decentralised domain-driven ownership of data.
trustpilot/beat-exporter
Elastic beat-exporter for Prometheus
ananthdurai/airflow-training
Airflow training for the crunch conf
project-thirdeye/thirdeye
ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis. It enables anyone inside an organization to collaborate on effective identification and analysis of deviations in business and system metrics. ThirdEye supports the entire workflow from anomaly detection, over root-cause analysis, to issue resolution and post-mortem reporting.
sspaeti-com/awesome-dagster
A curated list of dagster code snippets for data engineers