Pinned Repositories
Agile_Data_Code
Chapter-wise code for Agile Data the O'Reilly book
awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
big_data_for_chimps
A Seriously Fun guide to Big Data Analytics in Practice
BigDataR_Examples
Data Science and Machine Learning Examples for Data Science Linux
btc_pandas_analysis
this is an ipython notebook using pandas library for a basic bitcoin trading analysis
bytecoin
CryptoNote protocol implementation
command-line-one-liners
Command line one-liners
content
Official content for Harvard CS109
crazy-data-circles
Files for the intro to D3 workshop
NLTK_2.0_Cookbook
IPython Notebook (and converted .py) examples based on those presented in Jacob Perkin's NLTK 2.0 Cookbook text
Gwillink's Repositories
Gwillink/NLTK_2.0_Cookbook
IPython Notebook (and converted .py) examples based on those presented in Jacob Perkin's NLTK 2.0 Cookbook text
Gwillink/BigDataR_Examples
Data Science and Machine Learning Examples for Data Science Linux
Gwillink/crosscompute-tutorials
Interactive notebooks you can use to learn how to analyze lots of data
Gwillink/data_science_fun_pack
Meta-repository of big data tools -- source and essential plugins for hadoop, pig, wukong, storm, kafka etc.
Gwillink/ipython_d3_mashup
iPython Notebook mashed up with d3.js for data analysis
Gwillink/machine_learning
Python coded examples and documentation of machine learning algorithms.
Gwillink/machineLearning
POC IDS anomaly detection engine built with iPython notebook, matplotlib, pandas, numpy, scikit-learn, d3.js, hyperloglog implementation, PYCON 2013 Intro and Advanced Machine Learning Tutorial Notebooks
Gwillink/MapReduce-course-2013s
MapReduce course at the University of Maryland for Spring 2013
Gwillink/MapReduceAlgorithms
Data-Intensive Text Processing with MapReduce
Gwillink/mapreducepatterns
Repository for MapReduce Design Patterns (O'Reilly 2012) example source code
Gwillink/mass-scraping
Quickly download and scrape websites on a massive scale.
Gwillink/Mining-the-Social-Web
The official online compendium for Mining the Social Web (O'Reilly, 2011)
Gwillink/probability
Notes, problems, simulations developed while following the MIT Open Courseware class "Probability Systems Analysis
Gwillink/Stanford-Data-Mining-Analytics-Stats-202
My coursework for Stanford's Statistical Aspects of Data Mining Course (Stats 202) in iPython Notebooks instead of R
Gwillink/tutorial_notebooks
IPython notebooks for the workshops provided by Montreal Python.
Gwillink/tutorials
Continuum's public tutorial repository
Gwillink/webscraping
collection of python webscraping tools