jbrowland's Stars
tiangolo/full-stack-fastapi-template
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
cloud-custodian/cloud-custodian
Rules engine for cloud security, cost optimization, and governance, DSL in yaml for policies to query, filter, and take actions on resources
faif/python-patterns
A collection of design patterns/idioms in Python
meltano/squared
Where the Meltano team runs Meltano! Get it???
meltano/meltano
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
aws-solutions-library-samples/guidance-for-sql-based-etl-with-apache-spark-on-amazon-eks
A guidance that provides declarative data processing capability, and workflow orchestration automation to help your business users (such as analysts and data scientists) access their data and create meaningful insights without the need for manual IT processes.
rochacbruno/python-project-template
DO NOT FORK, CLICK ON "Use this template" - A github template to start a Python Project - this uses github actions to generate your project based on the template.
GoogleCloudPlatform/education-data-platform
Education Data Platform (EDP) is a reference architecture followed by end-to-end blueprints, scripts and a suite of Terraform modules for Google Cloud Platform (GCP), designed to automate the creation, governance and observability of a modern and robust data repository for educational institutions, looking into becoming a data-driven organization.
bentoml/bentoctl
Fast model deployment on any cloud 🚀
awesome-spark/awesome-spark
A curated list of awesome Apache Spark packages and resources.
spotify/scio
A Scala API for Apache Beam and Google Cloud Dataflow.
scala-exercises/scala-exercises
The easy way to learn Scala.
mtdvio/every-programmer-should-know
A collection of (mostly) technical things every software developer should know about
high-performance-spark/high-performance-spark-examples
Examples for High Performance Spark
kelseyhightower/kubernetes-the-hard-way
Bootstrap Kubernetes the hard way. No scripts.
awslabs/data-on-eks
DoEKS is a tool to build, deploy and scale Data & ML Platforms on Amazon EKS
aws-samples/emr-serverless-samples
Example code for running Spark and Hive jobs on EMR Serverless.
nebari-dev/nebari
🪴 Nebari - your open source data science platform
andresionek91/bootcamp-turma-6-data-platform
Data Platform com AWS CDK
terraform-aws-modules/terraform-aws-ecs
Terraform module to create AWS ECS resources 🇺🇦
nicor88/aws-ecs-airflow
Run Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
rodalbuyeh/pyspark-k8s-boilerplate
Boilerplate for PySpark on Cloud Kubernetes
aws/chalice
Python Serverless Microframework for AWS
mspnp/spark-monitoring
Monitoring Azure Databricks jobs
microsoft/code-with-engineering-playbook
This is the playbook for "code-with" customer or partner engagements
getredash/redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Azure/data-product-analytics
Template to deploy a Data Product for analytics and data science use-cases into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to create insights and products for external users.
Azure/config-driven-data-pipeline
Azure-Samples/modern-data-warehouse-dataops
DataOps for the Modern Data Warehouse on Microsoft Azure. https://aka.ms/mdw-dataops.
cal-itp/data-infra
Cal-ITP data infrastructure