Pinned Repositories
airflow-pyspark-emr
This project demonstrate how to process data stored in a data lake fashion, transforming it into an OLAP optimized structure by using PySpark. The PySpark Job runs on AWS EMR, and the Data Pipeline is orchestrated by Apache Airflow, including the infrastructure creation and the EMR cluster termination.
airflow-tutorial
Apache Airflow tutorial
apache_airflow_on_eks
datasprints-open-spaces
Repository for the code demoed in the talk
DeepLearning101Keras
Movile Tech Meetup - 03/07/2019 - Rio de Janeiro - Presentation
delta-lake-on-glue-quickstart
This is a quick start guide for the Delta Lake (delta.io) Python Spark connector, running on AWS Glue.
ExploringLego
Data Analysis Project using Python Pandas
flame
Flame :fire: Opinionated Flask & MongoDB backend boilerplate.
hudi-on-glue-quick-start
AWS Glue PySpark - Apache Hudi Quick Start Guide
PythonImageUtils
GabrielAmazonas's Repositories
GabrielAmazonas/airflow-pyspark-emr
This project demonstrate how to process data stored in a data lake fashion, transforming it into an OLAP optimized structure by using PySpark. The PySpark Job runs on AWS EMR, and the Data Pipeline is orchestrated by Apache Airflow, including the infrastructure creation and the EMR cluster termination.
GabrielAmazonas/hudi-on-glue-quick-start
AWS Glue PySpark - Apache Hudi Quick Start Guide
GabrielAmazonas/datasprints-open-spaces
Repository for the code demoed in the talk
GabrielAmazonas/flame
Flame :fire: Opinionated Flask & MongoDB backend boilerplate.
GabrielAmazonas/delta-lake-on-glue-quickstart
This is a quick start guide for the Delta Lake (delta.io) Python Spark connector, running on AWS Glue.
GabrielAmazonas/aws-cft-samples
Sample Cloud Formation Template YAML configurations
GabrielAmazonas/code-pipeline-python-test
GabrielAmazonas/pyspark-emr-s3-datalake
GabrielAmazonas/airflow-dags
GabrielAmazonas/airflow-eks-helm-chart
Airflow helm chart for AWS EKS
GabrielAmazonas/airflow2-lambda-ingestion
Utilizando Apache Airflow para Integração com o AWS Lambda - Exemplos para Demandas de ETL
GabrielAmazonas/argon-dashboard
Argon - Dashboard for Bootstrap 5 by Creative Tim
GabrielAmazonas/aws-dms-user-guide
The open source version of the AWS DMS docs. You can submit feedback & requests for changes by submitting issues in this repo or by making proposed changes & submitting a pull request.
GabrielAmazonas/azure-sql-read-replica-terraform
Sample IaC with Terraform for Azure SQL Database - with HyperScale Secondary Replicas
GabrielAmazonas/dbt-poc
GabrielAmazonas/docs
AWS Amplify Documentation
GabrielAmazonas/docs-1
The open-source repo for docs.github.com
GabrielAmazonas/fiap-cartola-notebooks
GabrielAmazonas/fiap-challenge-1
GabrielAmazonas/FIAP-Challenge-1-Answers
GabrielAmazonas/fiap-spark-scratches
GabrielAmazonas/glue-iceberg-quickstart
GabrielAmazonas/material-dashboard-react
React version of Material Dashboard by Creative Tim
GabrielAmazonas/next.js
The React Framework
GabrielAmazonas/puc-rio-mvp-sp1
MVP for PUC RIO's Data Science & Analytics - Sprint 1
GabrielAmazonas/pucrio-dsa-dataengineering
PUC Rio - Data Science & Analytics Specialization - Data Engineering Sprint Project - Weightlifting Olympic History
GabrielAmazonas/pucrio-mvp-sp2
GabrielAmazonas/slim-bigdata-docker
Adapted from fabiogjardim/bigdata_docker, this project creates a docker environment with the following applications: Spark, Jupyter, Hadoop, HDFS and Hive.
GabrielAmazonas/terraform-cdk
Define infrastructure resources using programming constructs and provision them using HashiCorp Terraform
GabrielAmazonas/tradeshift-triangle
Tradeshift's Challenge