dataproc-clusters
There are 7 repositories under dataproc-clusters topic.
dwaiba/dataproc-terraform
Dataproc Customisable HA cluster debian-9 with zookeeper,kafka ,BigQuery and other tools/jobs with Terraform
AnveshaM/Enhancing-performance-of-big-data-machine-learning-models-on-Google-Cloud-Platform
The project is focused on parallelising pre-processing, measuring and machine learning in the cloud, as well as the evaluation and analysis of the cloud performance.
lucianocoelho-28/dio-desafio-dataproc-gcp
Digital Innovation One - Desafio GCP Dataproc. O desafio consiste em efetuar um processamento de dados utilizando o produto Dataproc do GCP. Esse processamento irá efetuar a contahem das palavras de um livro e informar quantas vezes cada palavra aparece no mesmo.
liang-sarah/voter-turnout
Logistic regression modeling of swing state voter turnout to support U.S. political campaign proposals
Tinmarian/Airflow2.0-De-0-a-Heroe
Repositorio para realizar el curso en Udemy llamado "Airflow2.0 De 0 a Héroe", de la academia "Datapath".
redvg/dataproc-pyspark-mapreduce
GCP Dataproc mapreduce sample with PySpark
redvg/dataproc-pyspark-monte-carlo
Monte Carlo simulations with PySpark on GCP Cloud Dataproc clusters