Pinned Repositories
amazon-sagemaker-examples
This is a fork of AWS Sagemaker examples, not only learning how to use AWS, but also go through the notebooks and familiarize myself with other AI applications.
awsarmy
a simple distribution framework to distribute work to a dozen of AWS boxes using python flask,boto..etc.
binwangREPO
first repository bin created for github
blog
this is a repo that contains the source code captured in my personal blog datafireball.com
chrome_notes
a chrome plugin to take notes and storage the notes locally
Cordova-Examples
A collection of Cordova/Ionic/etc demos.
cryptography
this is a repository where I store code for the Coursera class cryptography from Standford University and all the related fun staff :)
dejavu
Audio fingerprinting and recognition in Python
hadoop_raspberrypi
setting up hadoop on raspberry pi
sc1x
supply chain fundamentals from MIT EDX
datafireball's Repositories
datafireball/hadoop_raspberrypi
setting up hadoop on raspberry pi
datafireball/docker-selenium-hub
docker image for selenium server with headless firefox
datafireball/docker-selenium-node
datafireball/docker_scrapy
a scrapy template with bare minimum effort to be able to get the html of a list of urls
datafireball/getout
this is a python library to extract outlinks for a given URL
datafireball/namemapping
A name mapping library by Dan and Bin to cluster company names using Yahoo Boss API
datafireball/nutch-selenium
datafireball/nutch-selenium-grid-plugin
A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This allows Nutch to rely on Selenium/Firefox to fetch and load javascript/content; while keeping Nutch in charge of what it does best: crawling and further parsing.
datafireball/rgetout
A R package to get all the outlinks for a given URL