project for pyspark job, creating emr_serverless cluster, and scheduling glue crawler