vighneshanap

Azure Certified Data Engineer Associate based in London, UK

DataOrcLondon

Pinned Repositories

Build-a-Cloud-Data-Warehouse-Using-AWS-S3AndRedShift
In this project, we will build an ETL pipeline that extracts their data from S3, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team to continue finding insights in what songs their users are listening to.
Language:Jupyter Notebook10
Build-a-Data-Lake-Using-AWS-S3-Spark-Cluster
In this project, We'll build an ETL pipeline for a data lake. The data resides in S3, in a directory of JSON logs on user activity on the app, as well as a directory with JSON metadata on the songs in the app. We will load data from S3, process the data into analytics tables using Spark, and load them back into S3. Then deploy this Spark process on a cluster using AWS.
Language:Jupyter Notebook10
Data-Modeling-with-Apache-Cassandra
In this project, we’ll model user activity data for a music streaming app called Sparkify. Then create a noSQL database and ETL pipeline designed to optimize queries for understanding what songs users are listening to. We’ll model our data in Apache Cassandra to allow for specific queries provided by the analytics team at Sparky.
Language:Jupyter Notebook10
Data-Modeling-with-Postgres
In this project, we’ll model user activity data for a music streaming app called Sparkify. Then create a relational database and ETL pipeline designed to optimize queries for understanding what songs users are listening to. In PostgreSQL, we will also define Fact and Dimension tables and insert data into your new tables.
Language:Jupyter Notebook10
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Language:Python00
COVID-19
Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
00
ML-Week-3-CyberSecurity
Language:Jupyter Notebook11
adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
Language:Python00
d3
Bring data to life with SVG, Canvas and HTML. :bar_chart::chart_with_upwards_trend::tada:
Language:JavaScript00
Data-Pipelines-with-Airflow
In this project, we’ll work on the music streaming company’s data infrastructure by creating and automating a set of data pipelines. We’ll configure and schedule data pipelines with Airflow and monitor and debug production pipelines.
Language:Python00

vighneshanap's Repositories

vighneshanap/Data-Pipelines-with-Airflow
In this project, we’ll work on the music streaming company’s data infrastructure by creating and automating a set of data pipelines. We’ll configure and schedule data pipelines with Airflow and monitor and debug production pipelines.
Language:Python
vighneshanap/Build-a-Cloud-Data-Warehouse-Using-AWS-S3AndRedShift
In this project, we will build an ETL pipeline that extracts their data from S3, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team to continue finding insights in what songs their users are listening to.
Language:Jupyter Notebook1
vighneshanap/Build-a-Data-Lake-Using-AWS-S3-Spark-Cluster
In this project, We'll build an ETL pipeline for a data lake. The data resides in S3, in a directory of JSON logs on user activity on the app, as well as a directory with JSON metadata on the songs in the app. We will load data from S3, process the data into analytics tables using Spark, and load them back into S3. Then deploy this Spark process on a cluster using AWS.
Language:Jupyter Notebook1
vighneshanap/Data-Modeling-with-Apache-Cassandra
In this project, we’ll model user activity data for a music streaming app called Sparkify. Then create a noSQL database and ETL pipeline designed to optimize queries for understanding what songs users are listening to. We’ll model our data in Apache Cassandra to allow for specific queries provided by the analytics team at Sparky.
Language:Jupyter Notebook1
vighneshanap/Data-Modeling-with-Postgres
In this project, we’ll model user activity data for a music streaming app called Sparkify. Then create a relational database and ETL pipeline designed to optimize queries for understanding what songs users are listening to. In PostgreSQL, we will also define Fact and Dimension tables and insert data into your new tables.
Language:Jupyter Notebook1
vighneshanap/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
vighneshanap/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
vighneshanap/COVID-19
Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
vighneshanap/image_classifier_template
This is the code for "Image Classifier Example" by Siraj Raval on Youtube
vighneshanap/ML-Week-3-CyberSecurity
Language:Jupyter Notebook11
vighneshanap/adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
vighneshanap/ML-in-Finance
Stock Prediction
Language:Jupyter Notebook
vighneshanap/fastai-v3
Starter app for fastai v3 model deployment on Render
vighneshanap/image-classifier-tensorflowjs
Language:JavaScript
vighneshanap/keras-flask-deploy-webapp
:smiley_cat: Pretty&simple image classifier app template. Deploy your own trained model or pre-trained model (VGG, ResNet, Densenet) to a web app using Flask in 10 minutes.
Language:Python
vighneshanap/python-variables-readme-data-science-intro-000
Language:Jupyter Notebook
vighneshanap/python-practice-with-datatypes-data-science-intro-000
Language:Jupyter Notebook
vighneshanap/learn-co-sandbox
vighneshanap/DataCamp-Python-Programmer-Public
My DataCamp Certifications
Language:Jupyter Notebook
vighneshanap/flaskSaaS
A great starting point to build your SaaS in Flask & Python, with Stripe subscription billing 🚀
Language:Python
vighneshanap/h2o-tutorials
Tutorials and training material for the H2O Machine Learning Platform
Language:Jupyter Notebook
vighneshanap/d3
Bring data to life with SVG, Canvas and HTML. :bar_chart::chart_with_upwards_trend::tada:
Language:JavaScript
vighneshanap/Learn_Data_Science_in_3_Months
This is the Curriculum for "Learn Data Science in 3 Months" By Siraj Raval on Youtube
vighneshanap/pydata-book
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
Language:Jupyter Notebook

vighneshanap

Pinned Repositories

Build-a-Cloud-Data-Warehouse-Using-AWS-S3AndRedShift

Build-a-Data-Lake-Using-AWS-S3-Spark-Cluster

Data-Modeling-with-Apache-Cassandra

Data-Modeling-with-Postgres

airflow

COVID-19

ML-Week-3-CyberSecurity

adversarial

d3

Data-Pipelines-with-Airflow

vighneshanap's Repositories

vighneshanap/Data-Pipelines-with-Airflow

vighneshanap/Build-a-Cloud-Data-Warehouse-Using-AWS-S3AndRedShift

vighneshanap/Build-a-Data-Lake-Using-AWS-S3-Spark-Cluster

vighneshanap/Data-Modeling-with-Apache-Cassandra

vighneshanap/Data-Modeling-with-Postgres

vighneshanap/airflow

vighneshanap/pandas

vighneshanap/COVID-19

vighneshanap/image_classifier_template

vighneshanap/ML-Week-3-CyberSecurity

vighneshanap/adversarial

vighneshanap/ML-in-Finance

vighneshanap/fastai-v3

vighneshanap/image-classifier-tensorflowjs

vighneshanap/keras-flask-deploy-webapp

vighneshanap/python-variables-readme-data-science-intro-000

vighneshanap/python-practice-with-datatypes-data-science-intro-000

vighneshanap/learn-co-sandbox

vighneshanap/DataCamp-Python-Programmer-Public

vighneshanap/flaskSaaS

vighneshanap/h2o-tutorials

vighneshanap/d3

vighneshanap/Learn_Data_Science_in_3_Months

vighneshanap/pydata-book