tejasmanohar's Stars
getredash/redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
gravitational/teleport
The easiest, and most secure way to access and protect all of your infrastructure.
rrweb-io/rrweb
record and replay the web
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
temporalio/temporal
Temporal service
great-expectations/great_expectations
Always know what to expect from your data.
datahub-project/datahub
The Metadata Platform for your Data Stack
delta-io/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
MaterializeInc/materialize
The data warehouse for operational workloads.
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
stream-labs/desktop
Free and open source streaming software built on OBS and Electron.
amundsen-io/amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
andialbrecht/sqlparse
A non-validating SQL parser module for Python
lightdash/lightdash
Self-serve BI to 10x your data team ⚡️
google/differential-privacy
Google's differential privacy libraries.
OpenLineage/OpenLineage
An Open Standard for lineage metadata collection
re-data/re-data
re_data - fix data issues before your users & CEO would discover them 😊
opticdev/optic
OpenAPI linting, diffing and testing. Optic helps prevent breaking changes, publish accurate documentation and improve the design of your APIs.
dbt-labs/metricflow
MetricFlow allows you to define, build, and maintain metrics in code.
dataform-co/dataform
Dataform is a framework for managing SQL based data operations in BigQuery
delta-io/delta-sharing
An open protocol for secure data sharing
mozilla/moz-sql-parser
DEPRECATED - Let's make a SQL parser so we can provide a familiar interface to non-sql datastores!
snowflakedb/spark-snowflake
Snowflake Data Source for Apache Spark.
fivetran/benchmark
Benchmark data warehouses under Fivetran-like conditions
salto-io/salto
Salto enables you to manage your business applications' configuration in code
getmetamapper/metamapper
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
lucidsoftware/piezo
Piezo is a set of tools for operating a quartz scheduling cluster.
hightouchio/airflow-provider-hightouch
Airflow operators, hooks, and sensors for interacting with the Hightouch API
jerryjliu/llama_index
LlamaIndex (formerly GPT Index) is a data framework for your LLM applications