Pinned Repositories
bdutil
[DEPRECATED] Script used to manage Hadoop and Spark instances on Google Compute Engine
cloud-dataproc
Cloud Dataproc: Samples and Utils
custom-images
Tools for creating Dataproc custom images
flink-bigquery-connector
BigQuery integration to Apache Flink's Table API
hadoop-connectors
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
hive-bigquery-storage-handler
Hive Storage Handler for interoperability between BigQuery and Apache Hive
initialization-actions
Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
jupyterhub-dataprocspawner
spark-bigquery-connector
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
spark-spanner-connector
Cloud Spanner Connector for Apache Spark
Google Cloud Dataproc's Repositories
GoogleCloudDataproc/initialization-actions
Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
GoogleCloudDataproc/spark-bigquery-connector
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
GoogleCloudDataproc/hadoop-connectors
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
GoogleCloudDataproc/cloud-dataproc
Cloud Dataproc: Samples and Utils
GoogleCloudDataproc/bdutil
[DEPRECATED] Script used to manage Hadoop and Spark instances on Google Compute Engine
GoogleCloudDataproc/custom-images
Tools for creating Dataproc custom images
GoogleCloudDataproc/hive-bigquery-storage-handler
Hive Storage Handler for interoperability between BigQuery and Apache Hive
GoogleCloudDataproc/spark-spanner-connector
Cloud Spanner Connector for Apache Spark
GoogleCloudDataproc/flink-bigquery-connector
BigQuery integration to Apache Flink's Table API
GoogleCloudDataproc/jupyterhub-dataprocspawner
GoogleCloudDataproc/hive-bigquery-connector
A library enabling BigQuery as Hive storage handler
GoogleCloudDataproc/dataproc-jupyter-plugin
GoogleCloudDataproc/dataproc-jdbc-connector
GoogleCloudDataproc/dataprocmagic
GoogleCloudDataproc/spark-bigtable-connector
GoogleCloudDataproc/.allstar
GoogleCloudDataproc/.github
GoogleCloudDataproc/dataproc-spark-connect-python