pgvillena's Stars
ClickHouse/ClickHouse
ClickHouse® is a real-time analytics DBMS
cube-js/cube
📊 Cube — The Semantic Layer for Building Data Applications
imohitmayank/jaal
Your interactive network visualizing dashboard
aws-samples/finetune-deploy-bert-with-amazon-sagemaker-for-hugging-face
aws-samples/codenator-automatic-code-generation-and-execution-using-llm
mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
grai-io/grai-core
dbt-labs/dbt-athena
The athena adapter plugin for dbt (https://getdbt.com)
aws-samples/dbt-glue
This repository contains the dbt-glue adapter
aws/aws-lakeformation-best-practices
tomasfarias/airflow-dbt-python
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
dbt-labs/dbt-redshift
dbt-redshift contains all of the code enabling dbt to work with Amazon Redshift
aws-samples/amazon-redshift-devops-blog
sqlalchemy/alembic
A database migrations tool for SQLAlchemy.
meltano/meltano
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
opengeospatial/geoparquet
Specification for storing geospatial vector data (point, line, polygon) in Parquet
EthanRBrown/rrad
Real, Random Address Data (RRAD)
aws-samples/amazon-redshift-auto-testing-with-data-api
This repository provides a method to automate repeated testing of queries with Amazon Redshift Data API
terricain/aioboto3
Wrapper to use boto3 resources with the aiobotocore async backend
mahmoud/glom
☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️
sqlalchemy-redshift/sqlalchemy-redshift
Amazon Redshift SQLAlchemy Dialect
aws/aws-mwaa-local-runner
This repository provides a command line interface (CLI) utility that replicates an Amazon Managed Workflows for Apache Airflow (MWAA) environment locally.
MarquezProject/marquez
Collect, aggregate, and visualize a data ecosystem's metadata
OpenLineage/OpenLineage
An Open Standard for lineage metadata collection
aws/aws-cdk
The AWS Cloud Development Kit is a framework for defining cloud infrastructure in code
aws-samples/aws-analytics-reference-architecture
aws-samples/aws-cdk-examples
Example projects using the AWS CDK
projectmesa/mesa
Mesa is an open-source Python library for agent-based modeling, ideal for simulating complex systems and exploring emergent behaviors.
QuantConnect/Lean
Lean Algorithmic Trading Engine by QuantConnect (Python, C#)
debezium/debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.