Pinned Repositories
Build-a-Cloud-Data-Warehouse-Using-AWS-S3AndRedShift
In this project, we will build an ETL pipeline that extracts their data from S3, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team to continue finding insights in what songs their users are listening to.
Build-a-Data-Lake-Using-AWS-S3-Spark-Cluster
In this project, We'll build an ETL pipeline for a data lake. The data resides in S3, in a directory of JSON logs on user activity on the app, as well as a directory with JSON metadata on the songs in the app. We will load data from S3, process the data into analytics tables using Spark, and load them back into S3. Then deploy this Spark process on a cluster using AWS.
Data-Modeling-with-Apache-Cassandra
In this project, we’ll model user activity data for a music streaming app called Sparkify. Then create a noSQL database and ETL pipeline designed to optimize queries for understanding what songs users are listening to. We’ll model our data in Apache Cassandra to allow for specific queries provided by the analytics team at Sparky.
Data-Modeling-with-Postgres
In this project, we’ll model user activity data for a music streaming app called Sparkify. Then create a relational database and ETL pipeline designed to optimize queries for understanding what songs users are listening to. In PostgreSQL, we will also define Fact and Dimension tables and insert data into your new tables.
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
COVID-19
Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
ML-Week-3-CyberSecurity
adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
d3
Bring data to life with SVG, Canvas and HTML. :bar_chart::chart_with_upwards_trend::tada:
Data-Pipelines-with-Airflow
In this project, we’ll work on the music streaming company’s data infrastructure by creating and automating a set of data pipelines. We’ll configure and schedule data pipelines with Airflow and monitor and debug production pipelines.
vighneshanap's Repositories
vighneshanap/Data-Pipelines-with-Airflow
In this project, we’ll work on the music streaming company’s data infrastructure by creating and automating a set of data pipelines. We’ll configure and schedule data pipelines with Airflow and monitor and debug production pipelines.
vighneshanap/Build-a-Cloud-Data-Warehouse-Using-AWS-S3AndRedShift
In this project, we will build an ETL pipeline that extracts their data from S3, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team to continue finding insights in what songs their users are listening to.
vighneshanap/Build-a-Data-Lake-Using-AWS-S3-Spark-Cluster
In this project, We'll build an ETL pipeline for a data lake. The data resides in S3, in a directory of JSON logs on user activity on the app, as well as a directory with JSON metadata on the songs in the app. We will load data from S3, process the data into analytics tables using Spark, and load them back into S3. Then deploy this Spark process on a cluster using AWS.
vighneshanap/Data-Modeling-with-Apache-Cassandra
In this project, we’ll model user activity data for a music streaming app called Sparkify. Then create a noSQL database and ETL pipeline designed to optimize queries for understanding what songs users are listening to. We’ll model our data in Apache Cassandra to allow for specific queries provided by the analytics team at Sparky.
vighneshanap/Data-Modeling-with-Postgres
In this project, we’ll model user activity data for a music streaming app called Sparkify. Then create a relational database and ETL pipeline designed to optimize queries for understanding what songs users are listening to. In PostgreSQL, we will also define Fact and Dimension tables and insert data into your new tables.
vighneshanap/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
vighneshanap/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
vighneshanap/COVID-19
Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
vighneshanap/image_classifier_template
This is the code for "Image Classifier Example" by Siraj Raval on Youtube
vighneshanap/ML-Week-3-CyberSecurity
vighneshanap/adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
vighneshanap/ML-in-Finance
Stock Prediction
vighneshanap/fastai-v3
Starter app for fastai v3 model deployment on Render
vighneshanap/image-classifier-tensorflowjs
vighneshanap/keras-flask-deploy-webapp
:smiley_cat: Pretty&simple image classifier app template. Deploy your own trained model or pre-trained model (VGG, ResNet, Densenet) to a web app using Flask in 10 minutes.
vighneshanap/python-variables-readme-data-science-intro-000
vighneshanap/python-practice-with-datatypes-data-science-intro-000
vighneshanap/learn-co-sandbox
vighneshanap/DataCamp-Python-Programmer-Public
My DataCamp Certifications
vighneshanap/flaskSaaS
A great starting point to build your SaaS in Flask & Python, with Stripe subscription billing 🚀
vighneshanap/h2o-tutorials
Tutorials and training material for the H2O Machine Learning Platform
vighneshanap/d3
Bring data to life with SVG, Canvas and HTML. :bar_chart::chart_with_upwards_trend::tada:
vighneshanap/Learn_Data_Science_in_3_Months
This is the Curriculum for "Learn Data Science in 3 Months" By Siraj Raval on Youtube
vighneshanap/pydata-book
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media