Pinned Repositories
big_data_for_chimps
A Seriously Fun guide to Big Data Analytics in Practice
datasets
Datasets that I generally use for trainings, workshops
Datasets-1
Machine learning datasets used in tutorials on MachineLearningMastery.com
Flight_delay_prediction_web_app
A big data web application to predict USA airline traffic delay with Python, Flask, Apache Spark, Kafka, MongoDB, ElasticSearch, d3.js, scikit-learn, MLlib and Apache Airflow.
free-programming-books
:books: Freely available programming books
lc
A list of 160+ leetcode questions grouped by their common patterns
nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
SparkInternals
Notes talking about the design and implementation of Apache Spark
standards
Standards and guidelines at Gilt
SynapsePySparkWordCount
Create Spark Job Defination
Lambda ML's Repositories
LambdaML/big_data_for_chimps
A Seriously Fun guide to Big Data Analytics in Practice
LambdaML/big-data-code
Source code for Big Data: Principles and best practices of scalable realtime data systems
LambdaML/GameOfLife
Example project of "game of life" with Unity.
LambdaML/grafana-spark-dashboards
Scripts for generating Grafana dashboards for monitoring Spark jobs
LambdaML/hired-challenges
Coding challenges for Hired.com signup. They require completion of the first 2, then 1 of the final 3.
LambdaML/iOSPorts
A collection of libraries such as OpenSSL, Cyrus SASL, OpenLDAP, and PCRE which have been ported to the iPhone/iOS platform.
LambdaML/learning-spark-examples
Examples for learning spark
LambdaML/linkedin-zookeeper
This project provides utilities and wrappers around ZooKeeper
LambdaML/maven-profiling-logger
Profiling logger for Maven
LambdaML/nlptutorial
A Tutorial about Programming for Natural Language Processing
LambdaML/showthedocs
LambdaML/spark-pivot-examples
spark pivot examples
LambdaML/spark-streaming-testbed
Set of applications to test the performances of Spark Streaming
LambdaML/split-apply-combine
Presentation about the split-apply-combine strategy in Data Science and Python
LambdaML/syllabus
Syllabus for the Spring 2015 Data Engineering class at CU Boulder by Prof. Ken Anderson
LambdaML/zeppelin-authentication
Simple authentication for Zeppelin