Pinned Repositories
.dotfiles
my dotfiles
corpus_preprocessing
Scripts to preprocess bodies of texts to get at 'most important' phrases or most common phrases
divvy_data_challenge
Scripts used in data analysis for the Divvy data challenge
luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
ProgrammingAssignment2
Repository for Programming Assignment 2 for R Programming on Coursera
runfast
scavro
An SBT plugin for automatically calling Avro code generation and a thin scala wrapper for reading and writing Avro files
sentiment_suite
Functions for basic sentiment analysis on texts
spark-jobserver
REST job server for Apache Spark
luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
deathbymochi's Repositories
deathbymochi/corpus_preprocessing
Scripts to preprocess bodies of texts to get at 'most important' phrases or most common phrases
deathbymochi/.dotfiles
my dotfiles
deathbymochi/divvy_data_challenge
Scripts used in data analysis for the Divvy data challenge
deathbymochi/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
deathbymochi/ProgrammingAssignment2
Repository for Programming Assignment 2 for R Programming on Coursera
deathbymochi/runfast
deathbymochi/scavro
An SBT plugin for automatically calling Avro code generation and a thin scala wrapper for reading and writing Avro files
deathbymochi/sentiment_suite
Functions for basic sentiment analysis on texts
deathbymochi/spark-jobserver
REST job server for Apache Spark