nriyer
Director of Data Science @DataSociety www.datasociety.com ---Passionate about spreading the love for data science
Pinned Repositories
aas
Code to accompany Advanced Analytics with Spark from O'Reilly Media
apache-spark-sparkr-ml
A simple demo of implementing a Random Forest based classifier, using Spark R with Apache Spark, and same in Java. There are some issues that I see using Spark R in this type of activity, and using the Java MLLib is a superior solution .
awesome-cto
A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups
awesome-leading-and-managing
Awesome List of resources on leading people and being a manager. Geared toward tech, but potentially useful to anyone.
awesome-readme
A curated list of awesome READMEs
courses
Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1
csvkit
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
venga_practicum
nriyer's Repositories
nriyer/venga_practicum
nriyer/aas
Code to accompany Advanced Analytics with Spark from O'Reilly Media
nriyer/apache-spark-sparkr-ml
A simple demo of implementing a Random Forest based classifier, using Spark R with Apache Spark, and same in Java. There are some issues that I see using Spark R in this type of activity, and using the Java MLLib is a superior solution .
nriyer/awesome-cto
A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups
nriyer/awesome-leading-and-managing
Awesome List of resources on leading people and being a manager. Geared toward tech, but potentially useful to anyone.
nriyer/awesome-readme
A curated list of awesome READMEs
nriyer/courses
Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1
nriyer/csvkit
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
nriyer/project_1
big data project 1
nriyer/project_2_bigdata
nriyer/data-science-at-the-command-line
Data Science at the Command Line
nriyer/drip
Fast JVM launching without the hassle of persistent JVMs.
nriyer/enlighten-apply
Example code and materials that illustrate applications of SAS machine learning techniques.
nriyer/git
Git Source Code Mirror - This is a publish-only repository and all pull requests are ignored. Please follow Documentation/SubmittingPatches procedure for any of your improvements.
nriyer/golearn
Machine Learning for Go
nriyer/gwu-cloud-workshop
Materials for an Intro to Amazon Web Services
nriyer/homebrew
:beer: The missing package manager for OS X.
nriyer/httpie
Modern command line HTTP client – user-friendly curl alternative with intuitive UI, JSON support, syntax highlighting, wget-like downloads, extensions, etc. https://httpie.org
nriyer/indy
Looking at the subject of equity vs. equality in the field of education, using data from WorldBank
nriyer/NLP
This is repository includes text mining and natural language processing files and codes.
nriyer/project_3
nriyer/spark-csv
CSV data source for Spark SQL and DataFrames
nriyer/star_schema
nriyer/time_series_notebooks
A notebook to show the initial EDA of Time Series Data
nriyer/UBCadv-r
Note sharing for a discussion group around the Advanced R Programming book (http://adv-r.had.co.nz/)
nriyer/warehousing-course
Materials for GWU/GWSB ISTM 6211-11, Data Warehousing / OLAP