-
This repository has a real example on how to set-up PySpark Glue Jobs performing Upserts on a Data Lake on AWS using Apache Hudi and Terraform as IaC.
-
Read this article: https://aws.amazon.com/pt/blogs/big-data/writing-to-apache-hudi-tables-using-aws-glue-connector/
- Go to the AWS Glue Studio Console, search for AWS Glue Connector for Apache Hudi and choose AWS Glue Connector for Apache Hudi link.