Pinned Repositories
AppAcHomeworks-2021
dbt-learn-on-demand
dbt fundamentals training course: https://courses.getdbt.com/courses/fundamentals
espn-ffb-historical-records-etl
Airflow DAG to run ETL process to populate a BigQuery db with historical records from the Jayhawk Keeper League fantasy football league. Pipeline also outputs data as a CSV delivered via email.
li-learning-terraform-3087701
This repo is for the Linkedin Learning course: Learning Terraform
reddit-api-pipeline
An ELT pipeline to pull post data from Reddit's r/dataengineering subreddit and push to S3 and Snowflake. Once in Snowflake, data is then transformed via dbt (not orchestrated in these scripts)
sparkify-airflow-etl
Airflow DAG to run ETL process to populate Redshift db with Sparkify data from S3 data sources
sparkify-s3-datalake
An ETL pipeline to process data via Spark and create a S3 datalake for (fictional) music app Sparkify with data on song/artist/etc. entities and user listening behavior. Project from Udacity's Data Eng nanodegree program.
sparkifydb-apachecassandra
Create an Apache Cassandra db and an ETL pipeline to populate the db with user behavior data from a (fictional) Sparkify music streaming app. Project from Udacity's Data Engineering Nanodegree program
sparkifydb-postgres
An ETL pipeline to create + populate a Postgres db named sparkifydb for (fictional) music app with data on song/artist/etc. entities and user listening behavior. Project from Udacity's Data Eng nanodegree program
sparkifydb-redshift
An ETL pipeline to create + populate a Redshift db for (fictional) music app Sparkify with data on song/artist/etc. entities and user listening behavior. Project from Udacity's Data Eng nanodegree program
mimoyer21's Repositories
mimoyer21/espn-ffb-historical-records-etl
Airflow DAG to run ETL process to populate a BigQuery db with historical records from the Jayhawk Keeper League fantasy football league. Pipeline also outputs data as a CSV delivered via email.
mimoyer21/AppAcHomeworks-2021
mimoyer21/dbt-learn-on-demand
dbt fundamentals training course: https://courses.getdbt.com/courses/fundamentals
mimoyer21/li-learning-terraform-3087701
This repo is for the Linkedin Learning course: Learning Terraform
mimoyer21/reddit-api-pipeline
An ELT pipeline to pull post data from Reddit's r/dataengineering subreddit and push to S3 and Snowflake. Once in Snowflake, data is then transformed via dbt (not orchestrated in these scripts)
mimoyer21/sparkify-airflow-etl
Airflow DAG to run ETL process to populate Redshift db with Sparkify data from S3 data sources
mimoyer21/sparkify-s3-datalake
An ETL pipeline to process data via Spark and create a S3 datalake for (fictional) music app Sparkify with data on song/artist/etc. entities and user listening behavior. Project from Udacity's Data Eng nanodegree program.
mimoyer21/sparkifydb-apachecassandra
Create an Apache Cassandra db and an ETL pipeline to populate the db with user behavior data from a (fictional) Sparkify music streaming app. Project from Udacity's Data Engineering Nanodegree program
mimoyer21/sparkifydb-postgres
An ETL pipeline to create + populate a Postgres db named sparkifydb for (fictional) music app with data on song/artist/etc. entities and user listening behavior. Project from Udacity's Data Eng nanodegree program
mimoyer21/sparkifydb-redshift
An ETL pipeline to create + populate a Redshift db for (fictional) music app Sparkify with data on song/artist/etc. entities and user listening behavior. Project from Udacity's Data Eng nanodegree program