okzapradhana/dataproc-spark-processing
Contains PySpark jobs to do batch processing from GCS to BigQuery & GCS to GCS and also bash script to perform end to end Dataproc process from creating cluster, submitting jobs and delete cluster.
Python
Contains PySpark jobs to do batch processing from GCS to BigQuery & GCS to GCS and also bash script to perform end to end Dataproc process from creating cluster, submitting jobs and delete cluster.
Python