Pinned Repositories
activity-events-calendar
airflow-aws-sparkify
Created an ETL on airflow that extracts files from S3, loads them into staging tables in Redshift, then transforms the data and loads it again to Redshift
amadeus-challenge-python
Technical exercise proposed for Amadeus interview process.
amadeus-challenge-scala
Technical exercise proposed for Amadeus interview process. (First contact with Scala & Spark)
apache-beam-101
Playing around with Apache Beam in Python
apache-iceberg-playground
sparkify-etl-postgresql
ETL from data sources stored in JSON to PostgreSQL database. Written in Python
miguruiz's Repositories
miguruiz/amadeus-challenge-scala
Technical exercise proposed for Amadeus interview process. (First contact with Scala & Spark)
miguruiz/sparkify-etl-postgresql
ETL from data sources stored in JSON to PostgreSQL database. Written in Python
miguruiz/activity-events-calendar
miguruiz/airflow-aws-sparkify
Created an ETL on airflow that extracts files from S3, loads them into staging tables in Redshift, then transforms the data and loads it again to Redshift
miguruiz/amadeus-challenge-python
Technical exercise proposed for Amadeus interview process.
miguruiz/apache-beam-101
Playing around with Apache Beam in Python
miguruiz/apache-iceberg-playground
miguruiz/assignment-booking-analysis
Assignment for data engineering candidates to create a tool to provide commercial insights based on booking data
miguruiz/devops-intro-project
Project files for Intro to DevOps class
miguruiz/machine-learning-factory-demand
Utilizing time-series prediction library prophet, NLP, and other Machine Learning algorithms to predict the demand of a factory.
miguruiz/docker-ci-cd-playground
Containerized frontend - using Docker - that gets tested everytime there is a PR opened/modified, and once the tests pass and the PR is merged to master, deploys the app to AWS Elastic Beanstalk. The automation is done through Github Actions.
miguruiz/docker-sbt
miguruiz/immigration-etl
Drafting an ETL on jupyter notebooks that cleans and models us immigration data
miguruiz/kafka-chicago-trains
miguruiz/language-markdown
Add support for Markdown to Atom (including Github flavored, Markdown Extra, CriticMark, YAML/TOML front-matter, and R Markdown), and smart behavior to lists.
miguruiz/odin-recipes
miguruiz/ohmyzsh-espanso-and-aliases
Custom version of the aliases plugin for enriched, combined with common-aliases
miguruiz/rdf-to-property-parser
Parser for rdf data to property
miguruiz/robot-java
Line-follower robot with speech recognition to identify where to stop. Developed for Lego NXT
miguruiz/spark-streaming-seniors
miguruiz/sparkify-aws-emr-pyspark
Testing pyspark on AWS EMR with sparkify dataset
miguruiz/sparkify-etl-aws-redshift
ETL from data sources stored in JSON to AWS Redshift database. Written in Python
miguruiz/sparking-curiosity
Presentation about Spark
miguruiz/sql-spark-connector
Apache Spark Connector for SQL Server and Azure SQL
miguruiz/terraform-playground
Playing with terraform