gcp-dataproc
There are 14 repositories under gcp-dataproc topic.
prakashdontaraju/google-cloud-ecommerce
ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipeline ― Cloud Storage, Dataproc, PySpark, Cloud Spanner and Tableau
askmrsinh/spark-stocksim
Monte Carlo stock simulation using Apache Spark.
emanuelegiona/CC2019
Project for Cloud Computing course (A.Y. 2018/2019)
prodriguezdefino/dataproc-workflowtemplate-cloudfunction
Implements a work queue for Dataproc Worflow Template executions
tansudasli/spark-sandbox
Apache spark sandbox on GCP and Amazon EMR.
aeronaut2001/Car-Insurance-Cold-Calls-Data-Analysis
Car Insurance Cold Calls Data Analysis using Apache Hive
aeronaut2001/Marketing-Campaign-Data-Analysis
Marketing Campaign Data Analysis Using Apache Spark (PySpark)
aeronaut2001/Movie-Rating-Analysis
Movie Rating Analysis using Apache Spark (pyspark)
bug-data/Big_Data_First_Project
First project for Big Data course held at Roma Tre University
ElhNour/large-scale-data-management-spark
Process large amount of data and implement complex data analyses using Spark. The dataset has been made available by Google. It includes data about a cluster of 12500 machines, and the activity on this cluster during 29 days.
nrohit78/PigHive_StackExhangeData
Data is fetched from StackExchange, transformed using Pig, queried and stored in Hive. Additionally, the TF-IDF of the top 10 users is calculated using Hive.
RickLeite/Hadoop-Google-DataProc-DIOstudy
Hadoop Google DataProc DIO study
visalvo/projectScalable
Project for Scalable and Cloud Programming Course - 2018/19 UNIBO.