Pinned Repositories
airflow
Apache Airflow
cassandra-guide
Source code for Cassandra: The Definitive Guide, 2nd Ed
Cross-Project-Datasets
Repo for Cross Project Datasets of Interest
Hands-On-Big-Data-Modeling
Hands-On-Big-Data-Modeling, Published by Packt
Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
tech-interview-handbook
💯 Algorithms study materials, behavioral content and tips for rocking your coding interview
Udacity_Data_Engineering_ND
mmarich1's Repositories
mmarich1/amazon-redshift-utils
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment
mmarich1/awesome-cloud-native
A curated list for awesome cloud native tools, software and tutorials. - https://jimmysong.io/awesome-cloud-native/
mmarich1/awesome-pipeline
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
mmarich1/aws-cli
Universal Command Line Interface for Amazon Web Services
mmarich1/aws-glue-libs
AWS Glue Libraries are additions and enhancements to Spark for ETL operations.
mmarich1/aws-glue-samples
AWS Glue code samples
mmarich1/Cookbook
The Data Engineering Cookbook
mmarich1/corp
Assets related to the operation of Fishtown Analytics.
mmarich1/cstore_fdw
Columnar store for analytics with Postgres, developed by Citus Data. Check out the mailing list at https://groups.google.com/forum/#!forum/cstore-users or join our slack channel at https://slack.citusdata.com
mmarich1/data-engineer-roadmap
Roadmap to becoming a data engineer in 2021
mmarich1/data-pipelines-with-apache-airflow
Code for Data Pipelines with Apache Airflow
mmarich1/dbeaver
Free universal database tool and SQL client
mmarich1/docker-install
Docker installation script
mmarich1/facebook-python-business-sdk
An SDK built to facilitate application development for Facebook Ads API.
mmarich1/gbfs
Documentation for the General Bikeshare Feed Specification, a standardized data feed for bike share system availability
mmarich1/greenhouse-api-docs
Documentation for Greenhouse Software's APIs
mmarich1/handson-ml
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.
mmarich1/handson-ml2
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
mmarich1/moby
Moby Project - a collaborative project for the container ecosystem to assemble container-based systems
mmarich1/mondrian
Mondrian is an Online Analytical Processing (OLAP) server that enables business users to analyze large quantities of data in real-time.
mmarich1/PACKT-Python3-OOP-3rd
mmarich1/Programming-in-Scala-3rd
Code examples from the book 'Programming in Scala' (3rd ed) by Martin Odersky
mmarich1/pybikes
bike sharing + python = pybikes
mmarich1/Python-3-Object-Oriented-Programming-Third-Edition
Python 3 Object-Oriented Programming – Third Edition, published by Packt
mmarich1/PythonDataScienceHandbook
Python Data Science Handbook: full text in Jupyter Notebooks
mmarich1/pythonds
Problem Solving with Algorithms and Data Structures using Python
mmarich1/quickstart-ct-clickstream-analytics
AWS Quick Start Team
mmarich1/sampleproject
A sample project that exists for PyPUG's "Tutorial on Packaging and Distributing Projects"
mmarich1/setup.py
📦 A Human's Ultimate Guide to setup.py.
mmarich1/spark-redshift
Redshift data source for Apache Spark