bobhaffner's Stars
apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
python/mypy
Optional static typing for Python
11ty/eleventy
A simpler site generator. Transforms a directory of templates (of varying types) into HTML.
debezium/debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
great-expectations/great_expectations
Always know what to expect from your data.
PostgresApp/PostgresApp
The easiest way to get started with PostgreSQL on the Mac
apache/hudi
Upserts, Deletes And Incremental Processing on Big Data.
rafaelpadilla/Object-Detection-Metrics
Most popular metrics used to evaluate object detection algorithms.
databricks/koalas
Koalas: pandas API on Apache Spark
alexdebrie/awesome-dynamodb
List of resources for learning about modeling, operating, and using Amazon DynamoDB
boringPpl/data-engineer-roadmap
Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups
apache/iceberg-python
Apache PyIceberg
airbnb/SpinalTap
Change Data Capture (CDC) service
G-Research/spark-extension
A library that provides useful extensions to Apache Spark and PySpark.
JakobMiksch/geospatial-cli
A collection of geospatial programs with commandline interface
awslabs/data-solutions-framework-on-aws
An open-source framework that simplifies implementation of data solutions.
karayaman/lichess-with-a-real-board
Lichess.org client for real life chess boards.
johnculkin/UnofficialListOfPublicAWSRoadmaps
Unofficial list of Public AWS Roadmaps. Roadmaps are a great way to get a peek at what is being planned, view/make comments, and keep up with recently delivered updates.
sbalnojan/easy-functional-data-engineering
A cool simple example of functional data engineering
aws/aws-glue-databrew-jupyter-extension
dataintoresults/data-brewery
Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage data warehouse workflow.
sbalnojan/FDE-airflow-tutorial
Functional Data Engineering tutorial in Python & Airflow.
dacort/ci-cd-serverless-spark
Demo for GitHub Universe 2022
cloudshiftstrategies/tropoform
A Terraform like utility to manage AWS Cloud Formation stacks with troposphere
NateSolon/chessdata
Tools for analyzing chess data
bobhaffner/chess_positions
dustywhite7/lightning_talk_folium
A short talk on using Folium for mapping in Python, and associated files
iworkinpixels/OPPO
An FM synthesizer written in Python!
mfcallahan/angular-cli-esri-map-unit-testing
This repository demonstrates an approach for unit testing an Angular 11 application which uses the esri-loader to lazy load ArcGIS API for JavaScript modules. This example leverages Angular's built-in dependency injection and implements a Facade pattern in order to improve code testability.