/dataproc-spark-processing

Contains PySpark jobs to do batch processing from GCS to BigQuery & GCS to GCS and also bash script to perform end to end Dataproc process from creating cluster, submitting jobs and delete cluster.

Primary LanguagePython

Watchers