Pinned Repositories
canabalt
canabalt score analysis
chucknorrisfacts
Scraping Chuck Norris Facts from chucknorrisfacts.com
hockeyhacking
national_debt
Scrapes the national debt and tweets it
raindrop
real-time twitter analytics using streaming API and MongoDB
steve_jobs_tribute_messages
Analysis of Steve Jobs tribute messages submitted to Apple
top_1000_sites
Code to download and produce a datafile of Google's top 1000 sites
tweetParser
Parses raw twitter JSON from stdin using python. I'm only extracting a few fields for quick processing in PIG. Still a lot of work to do. Currently, it extracts id, timestamp, client program, author, and tweet text. I'll add more fields such as geo, if requested. The filenames for the output and bad tweets are currently hardcoded for my testing. I'll make this more dynamic shortly.
zip-code-data-hacking
sourcing publicly available files, generate useful zip code-county data
neilkod's Repositories
neilkod/steve_jobs_tribute_messages
Analysis of Steve Jobs tribute messages submitted to Apple
neilkod/zip-code-data-hacking
sourcing publicly available files, generate useful zip code-county data
neilkod/tweetParser
Parses raw twitter JSON from stdin using python. I'm only extracting a few fields for quick processing in PIG. Still a lot of work to do. Currently, it extracts id, timestamp, client program, author, and tweet text. I'll add more fields such as geo, if requested. The filenames for the output and bad tweets are currently hardcoded for my testing. I'll make this more dynamic shortly.
neilkod/hockeyhacking
neilkod/raindrop
real-time twitter analytics using streaming API and MongoDB
neilkod/top_1000_sites
Code to download and produce a datafile of Google's top 1000 sites
neilkod/chucknorrisfacts
Scraping Chuck Norris Facts from chucknorrisfacts.com
neilkod/canabalt
canabalt score analysis
neilkod/national_debt
Scrapes the national debt and tweets it
neilkod/2012_mb_corporate_run
Hacking results of 2012 mercedes benz corporate run
neilkod/abides
abides - a system for automatically posting tweets stored on a web adress
neilkod/coursera
Script for downloading Coursera.org videos and naming them.
neilkod/getting_started_with_d3
This is the code repository for the book "Getting Started With D3"
neilkod/lake_okochobee_level
extracting and analyzing water level for Lake Okochobee
neilkod/numbers_from_tweets
numbers_from_tweets
neilkod/oow-vote-hacking
Detecting relationships between voters and session creators
neilkod/replyTracker
monitors replies to a twitter account.
neilkod/commonTweeps
Finds and reports the overlap between two twitter members
neilkod/corporate5k
Analysis of times from Mereces Benz Corporate 5k(Ft. Lauderdale)
neilkod/fruit-ninja-scores
an analysis of fruit ninja scores, downloaded from twitter
neilkod/gardenhose-microslurp
bootstrap scripts for getting a micro ec2 instance piping gardenhose to s3
neilkod/gcr_tires_scraper
scraping gcrtires locations
neilkod/jaccard_sim
messing around with jaccard similarity
neilkod/ScipySuperpack
Recent builds of Numpy, Scipy, Matplotlib, iPython and PyMC for OSX
neilkod/utahpollen
scraping utah pollen data for historical purposes and tweeting
neilkod/bunny1
bunny1 is a tool that lets you write smart bookmarks in python and then share them across all your browsers and with a group of people or the whole world. It was developed at Facebook and is widely used there.
neilkod/neilkod.github.io
neilkod/neiss_2011_datamart
dimensionalizing and creating a data mart from the NEISS 2011 dataset
neilkod/presto
Official home of Presto, the distributed SQL query engine for big data
neilkod/probability
Probability Module, made by Jacob Kodner