google-cloud-dataproc
There are 12 repositories under google-cloud-dataproc topic.
kubeflow/spark-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
GoogleCloudPlatform/flink-on-k8s-operator
[DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
GoogleCloudDataproc/initialization-actions
Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
GoogleCloudDataproc/spark-bigquery-connector
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
GoogleCloudDataproc/hadoop-connectors
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
GoogleCloudDataproc/cloud-dataproc
Cloud Dataproc: Samples and Utils
GoogleCloudDataproc/custom-images
Tools for creating Dataproc custom images
kumgaurav/BigQuerySpark3Scala12
A sample demo to check latest spark, big query connector and scala 2.12
Eu-Bitwise/spark-json-streaming
Streaming JSON data to Spark or Google Cloud Dataproc.
jonathanAmancioSales/Hadoop_Dataproc_Google_Cloud_Platform_DIO
Projeto do Curso "Criando um Ecossistema Hadoop Totalmente Gerenciado com Google Cloud Dataproc" do Bootcamp Data Engineer da Digital Innovation One
VagnerBellacosa/030_CriandoUmEcossistemaHadoopTotalmenteGerenciadoComGoogleCloudDataproc
Sua missão será criar um ecossistema de Big Data usando o Google Cloud Platform (GCP). Para isso, o expert te ensinará a configurar o Google Cloud Dataproc, um Hadoop totalmente gerenciado, usando seus créditos gratuitos da GCP.