dnafrance's Stars
d3/d3
Bring data to life with SVG, Canvas and HTML. :bar_chart::chart_with_upwards_trend::tada:
spotify/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
ipython/ipython
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
OpenRefine/OpenRefine
OpenRefine is a free, open source power tool for working with messy data and improving it
spyder-ide/spyder
Official repository for Spyder - The Scientific Python Development Environment
axa-group/nlp.js
An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more
google/physical-web
The Physical Web: walk up and use anything
Quartz/bad-data-guide
An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
nborwankar/LearnDataScience
Open Content for self-directed learning in data science
hadley/ggplot2-book
ggplot2: elegant graphics for data analysis
ramnathv/rCharts
Interactive JS Charts from R
databricks/spark-csv
CSV Data Source for Apache Spark 1.x
ipython-books/cookbook-code
[DEPRECATED] See the new edition:
justmarkham/DAT4
General Assembly's Data Science course in Washington, DC
ramnathv/htmlwidgets
HTML Widgets for R
microsoft/node-v0.12
Enable Node.js to use Chakra as its JavaScript engine.
rcongiu/Hive-JSON-Serde
Read - Write JSON SerDe for Apache Hive.
justmarkham/python-reference
Python Quick Reference
Esri/gis-tools-for-hadoop
The GIS Tools for Hadoop are a collection of GIS tools for spatial analysis of big data.
ShinyEd/intro-stats
Shiny apps for introductory statistics
quux00/hive-json-schema
Tool to generate a Hive schema from a JSON example doc
yhat/DataGotham2013
chrisclark/PythonForDataScience
PythonForDataScience
karthik/testdat
A package to run unit tests on tabular data
AlexJF/fabric-scripts
Bunch of Fabric scripts I've created
mine-cetinkaya-rundel/useR-2015
Slides and demo materials for the "Using R, RStudio, and Docker for introductory statistics teaching" talk useR2015.
WhiteFangBuck/strata-2016-singapore
rirwin/theft-market
Infrastructure for analyzing historical real estate data
blprnt/artapis
tonyreddy/Apache-MultiNode-Insatallation-Shellscript
Apache Hadoop MultiNode Insatallation Shellscript