hadoop-cluster
There are 155 repositories under hadoop-cluster topic.
big-data-europe/docker-hadoop
Apache Hadoop docker image
groda/big_data
Big Data essentials: Hadoop, MapReduce, Spark. Explore tutorials and demos in Jupyter notebooks—most are self-contained and live, ready to run with a click.
Impetus/jumbune
Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Segence/docker-hadoop
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
sergevs/ansible-cloudera-hadoop
ansible playbook to deploy cloudera hadoop components to the cluster
Wittline/apache-spark-docker
Dockerizing an Apache Spark Standalone Cluster
rainmaple/WIFI_BussinessBigDataAnalyseSystem
A System is designed to analyse BigData collect from Wifi probe
hokstack/hok-helm
HokStack - Run Hadoop Stack on Kubernetes
hadoop-sandbox/hadoop-sandbox
A fully-functional Hadoop Yarn cluster as docker-compose deployment.
mikeroyal/Apache-Ignite-Guide
Apache Ignite Guide
waltherg/distributable_docker_sql_on_hadoop
Toy Hadoop cluster combining various SQL-on-Hadoop variants
hyeonsangjeon/dataplatform
Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
manuparra/MasterDegreeCC_Practice
Taller del Máster Profesional de Informática UGR. Curso de CloudComputing.
lyingbo/hadoop-cluster-docker
Run Hadoop Cluster within Docker Containers
pfisterer/apache-knox-docker
Dockerfile for running Apache Knox (http://knox.apache.org/) in Docker
hadoop-sandbox/hadoop-sandbox-images
Docker image builds for Hadoop sandbox.
MitaliBhiwande/Clustering-Algorithms
Colelction of various clustering algorithms including K means, HAC, DBscan. Also includes Hadoop, MapReduce, implementation of K mean algorithm
Shwetabhdixit/Hadoop-2.7.3-Installation-Guide-for_windows
A storage reference to a comprehensive guide on installing Hadoop on Windows
aimanamri/raspberry-pi4-hadoop-spark-cluster
This is a self-documentation of learning distributed data storage, parallel processing, and Linux OS using Apache Hadoop, Apache Spark and Raspbian OS. In this project, 3-node cluster will be setup using Raspberry Pi 4, install HDFS and run Spark processing jobs via YARN.
HxnDev/Finding-Average-Temperature-of-Each-Year-using-Hadoop-HDFS
In this task, we had to calculate the average temperature for each year from the given dataset using Hadoop HDFS. We had to create a MapReduce function to perform this task.
jinho-yoo-jack/HadoopCluster
based Docker
MengmSun/hadoop-in-docker
Hadoop in docker cluster, created by docker-compose. Create Hadoop cluster in less than 5mins.
AnalyticsApps/LogAnalyzer
Analyses the customer logs for bigdata components like HDFS, Hive, HBase, Yarn, MapReduce, Storm, Spark, Spark 2, Knox, Ambari Metrics, Nifi, Accumulo, Kafka, Flume, Oozie, Falcon, Atlas & Zookeeper.
balajic06/Big_Data
The project deals on how to perform Spatio-temporal hot-spot analysis using Apache Spark.
chriskery/hadoop-operator
Kubernetes operator for managing the lifecycle of Apache Hadoop Yarn Tasks on Kubernetes.
malabz/HAlign-2
a multiple sequence alignment tool
mitre/clusterconf
Manage Hadoop cluster configurations
PacktPublishing/Big-Data-Processing-with-Hadoop---A-Complete-Reference-Guide
Design, build, and execute effective big data strategies with advanced Hadoop concepts
roboxue/YarnVision
UI for Hadoop Resource Manager
tugrulhkarabulut/hadoop-movie-rating-prediction
Movie rating prediction application
vitobellini/bigdata-cluster
BigData Cluster with Docker
AsmaZgo/distribution_and_scripts
A repository for some scripts that can help in creating a distributed Big data ecosystem using the platform Grid5000.
conema/spark-terraform
This project create an Hadoop and Spark cluster on Amazon AWS with Terraform
huy-dataguy/HadoopSphere
Containerized Hadoop cluster with Spark, Hive, Pig, HBase, and Zookeeper for scalable Big Data processing using Docker.
mitre/webhdfs
Interface with WebHDFS Service in a Cluster-Neutral Way
peyaa/bigdata-platform-on-k8s
deploy bigdata platform on kubernetes