Pinned Repositories
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
airflow
AirFlow is a system to programmatically author, schedule and monitor data pipelines.
airflow-workshop-dataengconf-sf-2017
awesome-apache-airflow
Curated list of resources about Apache Airflow
blackjack-python
A text based blackjack game in python
etl-with-airflow
ETL best practices with airflow, with examples
hqlparser
A small prototype for parsing the Hive Query Language in Python
markdown_science
Learn how to use markdown for science
pygenie
artwr's Repositories
artwr/airflow
AirFlow is a system to programmatically author, schedule and monitor data pipelines.
artwr/awesome-apache-airflow
Curated list of resources about Apache Airflow
artwr/adventofcode2021
artwr/black
The uncompromising Python code formatter
artwr/challenges
PyBites Code Challenges
artwr/clean-url-chrome-extension
A very simple extension to open links while removing URL parameters
artwr/cookiecutter-pypackage
Cookiecutter template for a Python package.
artwr/cpython
The Python programming language
artwr/diffy
artwr/dotfiles
A small set of dotfiles with minor productivity enhancements
artwr/example-node-ops
Example node project with nginx, haproxy, and pool-hall
artwr/hmstools
A python package with useful utilities for interaction with the Apache Hive Metastore
artwr/iceberg
Iceberg is a table format for large, slow-moving tabular data
artwr/incubator-superset
Superset is a data exploration platform designed to be visual, intuitive, and interactive
artwr/ISO-3166-Countries-with-Regional-Codes
ISO 3166-1 country lists merged with their UN Geoscheme regional codes in ready-to-use JSON, XML, CSV data sets
artwr/kite
Kite SDK
artwr/md2googleslides
Generate Google Slides from markdown
artwr/mensor
A dynamic graph-based metric computation engine.
artwr/omniduct
A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (including HDFS, Hive, Presto, MySQL, etc).
artwr/open-source-archetypes
A field guide to open source project archetypes
artwr/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
artwr/papermill
📚 Parameterize, execute, and analyze notebooks
artwr/pgcli
Postgres CLI with autocompletion and syntax highlighting
artwr/Pweave
Pweave is a scientific report generator and a literate programming tool for Python. It can capture the results and plots from data analysis and works well with numpy, scipy and matplotlib.
artwr/rasa_core
machine learning based dialogue engine for AI assistants
artwr/README
My operating manual inspired by https://github.com/KatieLo/README
artwr/spark-distcp
A re-implementation of Hadoop DistCP in Apache Spark
artwr/spark-standalone-cluster-on-docker
Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. :zap:
artwr/sqlglot
Python SQL Parser and Transpiler
artwr/tdqcp