sparkify
There are 12 repositories under sparkify topic.
brunowdev/sparkify
This is the final project for the Data Scientist Nanodegree, where our goal is to predict churn for a fictional streaming service called Sparkify.
abduygur/churn-prediction-using-spark
Churn Prediction using PySpark
alessiococchieri/BDA-project-sparkify
This Git repo showcases my analysis of Sparkify dataset with PySpark on Apache Spark cluster mode and JupyterLab on Docker. The goal was to identify at-risk customers and develop retention strategies. The analysis tested multiple machine learning models and uncovered insights into customer behavior and churn patterns.
fpcarneiro/Data-Warehouse
Students will build an ETL pipeline that extracts data from S3, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team.
SimplifyData/Cloud-Data-Warehouse-with-Redshift-AWS
Cloud Data Warehouse of Sparkify Data using Redshift
cdumen/Sparkify_Churn_Prediction
Sparkify project for predicting customer loyality.
fpcarneiro/data-lake
Udacity Data Engineer Nanodegree: Project Data Lake
fpcarneiro/Data-Modeling-with-Cassandra
Project: Data Modeling with Cassandra
Guli-Y/Sparkify-s3-Spark-s3
ETL script for reading data from s3, processing them using Spark and loading them back to s3 for data analysis team
Mcamin/User-Churn-Prediction
Data Analysis in Spark to Identify Customer Churn for a fictional music service.
Guli-Y/SparkifyRedshift
a ETL pipeline for extracting data from s3, staging themon Redshift and transforming them into fact and dimensional tables for song play analysis
pratikwatwani/ETL-pipeline-for-Sparkify
An ETL model designed using Postgres SQL for Sparkify database 🗄, modeling user activity data to create a database and ETL pipeline🔀 for a music streaming app 🎼.