mutabazi
(big|small|open) #data #nerd ≡ #AI, #datascience, #FOSS, #p2p, #policy, #globaldev || Statistician @worldbank
Washington, DC, USA
mutabazi's Stars
pcloudcom/console-client
hearmecode/slides
Slides + What can I do after taking Lesson 1? Lesson 2? ... etc
plotly/dash
Data Apps & Dashboards for Python. No JavaScript Required.
mptrepanier/spark-saturday-advanced-hail
dmatrix/spark-saturday
Workshop for Spark and Databricks
OpenRefine/OpenRefine
OpenRefine is a free, open source power tool for working with messy data and improving it
dnielsen/vegnonveg
microsoft/WSL
Issues found on WSL
databricks/spark-deep-learning
Deep Learning Pipelines for Apache Spark
hi-primus/optimus
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
thomasnield/oreilly_getting_started_with_sql
Database files for the O'Reilly book "Getting Started with SQL: A hands on approach for beginners" http://goo.gl/z3zG54
adventuresinML/adventures-in-ml-code
This repository holds all the code for the site http://www.adventuresinmachinelearning.com
ben/python-2-3-exercises
Exercises for Porting Python class
TalkingData/Fregata
A light weight, super fast, large scale machine learning library on spark .
gtkcyber/drillworkshop
Learn how to quickly explore your data with Apache Drill
gtkcyber/griffon-vm
Griffon Data Science Virtual Machine
cgivre/data-exploration-with-apache-drill
Data Exploration with Apache Drill
mame/quine-relay
An uroboros program with 100+ programming languages
adbreind/spark-zeppelin-17-1
zackchase/mxnet-the-straight-dope
An interactive book on deep learning. Much easy, so MXNet. Wow. [Straight Dope is growing up] ---> Much of this content has been incorporated into the new Dive into Deep Learning Book available at https://d2l.ai/.
WhiteFangBuck/workshop
intel-analytics/BigDL-Tutorials
Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL
dennyglee/databricks
Repository of sample Databricks notebooks
fivethirtyeight/data
Data and code behind the articles and graphics at FiveThirtyEight
Azure/vagrant-azure
Enable Vagrant to manage virtual machines in Microsoft Azure
awslabs/deeplearning-cfn
Distributed Deep Learning on AWS Using CloudFormation (CFN), MXNet and TensorFlow
worldbank/ml4dev
Machine Learning for Development: A method to Learn and Identify Earth Features from Satellite Images
Quartz/bad-data-guide
An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
vincentarelbundock/countrycode
R package: Convert country names and country codes. Assigns region descriptors.
japila-books/apache-spark-internals
The Internals of Apache Spark