Pinned Repositories
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
apache-spark-examples
Apache Spark Examples
awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
aws-image-upload
Image Upload to AWS S3
CECL
chef-cookbook-spark
A chef cookbook for deploying spark
code_snippets
hadoop-book
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
Spark-Summit-June-2016
SparkContents
Books, Research Papers, Videos, Presentations and Certification for BigData
erdcpatel's Repositories
erdcpatel/credit-risk-modelling
Credit Risk analysis by using Python and ML
erdcpatel/dr-elephant
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
erdcpatel/machine-learning
notebooks with example for machine learning examples
erdcpatel/TensorFlow-Examples
TensorFlow Tutorial and Examples for beginners
erdcpatel/Linear-Regression-Workshop
This is the code for the "Introduction to Data Science and How to Do Linear Regression" workshop session by Jalaj Thanaki at IIT-Bombay on 24th August, 2017
erdcpatel/spark-two-migration
erdcpatel/spark-summit-2017-SanFrancisco
spark summit 2017 SanFrancisco
erdcpatel/spark-summit-east-2017
erdcpatel/spark-py-notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
erdcpatel/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
erdcpatel/awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
erdcpatel/spark2.0-examples
Examples of Spark 2.0
erdcpatel/high-performance-spark-examples
Examples for High Performance Spark
erdcpatel/databricks-spark-training
erdcpatel/mastering-apache-spark-book
Towards mastery of Apache Spark 2.0
erdcpatel/github-cheat-sheet
A list of cool features of Git and GitHub.
erdcpatel/spark-workshop
Project to get you prepared software-wise for the Spark Workshop
erdcpatel/scala-workshop
Scala Workshop (mostly notes before they shape well)
erdcpatel/Spark-Summit-June-2016
erdcpatel/apache-spark-examples
Apache Spark Examples
erdcpatel/data-science-your-way
Ways of doing Data Science Engineering and Machine Learning in R and Python
erdcpatel/hadoop-book
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
erdcpatel/spark-parquet-decimal-bug
Demonstrates that parquet silently discards values in some cases
erdcpatel/spark-parquet-nested-types
erdcpatel/learning-spark-examples
Examples for learning spark
erdcpatel/tpch-spark
TPC-H queries in spark SQL using native DataFrames API
erdcpatel/spark-cs100.1x
Working of CS100.1x, Introduction to Big Data with Apache Spark
erdcpatel/structured_data_processing_spark_sql
Code and setup information for Structured data processing with Spark sQL session
erdcpatel/fastdataprocessingwithsparkexamples
Examples for Fast Data Processing with Spark
erdcpatel/chef-cookbook-spark
A chef cookbook for deploying spark