spark-clusters
There are 17 repositories under spark-clusters topic.
PiercingDan/spark-Jupyter-AWS
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
airscholar/Japan-visa-data-engineering
This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark clusters are set up within a Docker container on Azure.
radanalyticsio/oshinko-cli
Command line interface for spark cluster management app
bruler/kub-setup
local kubernetes-based ml setup
cameres/emr-spark-jupyter
:notebook: Repository/Tutorial for initiallizing Jupyter Notebook and Spark cluster on Amazon EMR
conema/spark-terraform
This project create an Hadoop and Spark cluster on Amazon AWS with Terraform
s8sg/spark-py-submit
A python library to submit spark job in yarn cluster at different distributions (Currently CDH, HDP)
gioenn/sparkutils
A collection of scripts to easily start HDFS and Spark clusters
nikhilsu/Product-review-analysis-Spark-MongoDB
Performing various product review analysis on Amazon dataset using Apache Spark and MongoDB
hypnosapos/sparknetes
Spark on Kubernetes PoCs
kthakore/spark-notebook-dsp-template
Template for Spark Data Science Projects
reddy-s/spark-container
Docker image to deploy a spark cluster in containers
Seyzz/SparkScalaCluster
Research to setup and use a Spark Standalone Multi-Node Cluster.
surbhardwaj/AWS
Stuff done on AWS. Gathered the steps of creating spark cluster on EC2.
kumarvna/terraform-azurerm-hdinsight
Terraform module to create managed, full-spectrum, open-source analytics service Azure HDInsight. This module creates Apache Hadoop, Apache Spark, Apache HBase, Interactive Query (Apache Hive LLAP) and Apache Kafka clusters.
monyedavid/spark-cluster
spark-clusters management with docker