Pinned Repositories
airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
AlgorithmVisualizer
Algorithm Visualizer
bad-data-guide
An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
courses
Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1
dataproduct_deck
dataproducts
datasharing
The Leek group guide to data sharing
segment
tap-itunes
jgooly's Repositories
jgooly/segment
jgooly/tap-itunes
jgooly/airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
jgooly/AlgorithmVisualizer
Algorithm Visualizer
jgooly/bad-data-guide
An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
jgooly/courses
Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1
jgooly/dataproduct_deck
jgooly/dataproducts
jgooly/datasharing
The Leek group guide to data sharing
jgooly/dbt
Data build tool.
jgooly/docs.getdbt.com
The code behind docs.getdbt.com
jgooly/explr_data_project1
jgooly/getting-and-cleaning-data-project
jgooly/go
The Open Source Data Science Masters
jgooly/hadoop-mapreduce-udacity
jgooly/hdbscan
A high performance implementation of HDBSCAN clustering.
jgooly/InstaPy
📷 Instagram Like/Comment/Follow Automation Script
jgooly/jgooly.github.io
jgooly/machine-learning-project
jgooly/midpoint_part_1
jgooly/oh-my-zsh
A delightful community-driven (with 1,000+ contributors) framework for managing your zsh configuration. Includes 200+ optional plugins (rails, git, OSX, hub, capistrano, brew, ant, php, python, etc), over 140 themes to spice up your morning, and an auto-update tool so that makes it easy to keep up with the latest updates from the community.
jgooly/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
jgooly/probability_cheatsheet
A comprehensive 10-page probability cheatsheet that covers a semester's worth of introduction to probability.
jgooly/r-programming-assignment-2
Repository for Programming Assignment 2 for R Programming on Coursera
jgooly/RepData_PeerAssessment1
Peer Assessment 1 for Reproducible Research
jgooly/stat-cookbook
The probability and statistics cookbook
jgooly/stat212b
Topics Course on Deep Learning UC Berkeley
jgooly/zingg
Scalable fuzzy matching for data mastering, deduplication and entity resolution.