hadoop-mapreduce
There are 793 repositories under hadoop-mapreduce topic.
mahmoudparsian/data-algorithms-book
MapReduce, Spark, Java, and Scala for Data Algorithms Book
bytedance/CloudShuffleService
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
touero/ctenopharyngodon-idella
Hadoop, MapReduce Distributed Crawling of Data Information from All Chinese Universities.
groda/big_data
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
vim89/datapipelines-essentials-python
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
maniram-yadav/Big_DataHadoop_Projects
Big data projects implemented by Maniram yadav
seraogianluca/k-means-mapreduce
K-Means algorithm implementation with Hadoop and Spark for the course of Cloud Computing of the MSc AIDE at the University of Pisa.
caizkun/mapreduce-examples
A collection of mapreduce problems and solutions
anjalysam/Hadoop
This contain how to install Hadoop on google colab and how to run map-reduce in Hadoop
jmaister/wordcount
Hadoop MapReduce word counting with Java
absnaik810/CloudComputing
Projects done in the Cloud Computing course.
jyzhangchn/FBDP-project2
中文文本挖掘|舆情分析|Hadoop|Java|MapReduce
arshdeepbahga/cloud-computing-solutions-architect-book-code
Source code for the examples in the book Cloud Computing Solutions Architect: A Hands-On Approach by Arshdeep Bahga and Vijay Madisetti
MoustafaAMahmoud/BigDataInDepth
Data Engineering Course
benedekh/bigdata-projects
Student projects in Big Data field.
lucas91batista/twitter-hashtag-graph
Twitter + Flume + Hadoop (HDFS, MapReduce) + Neo4j + Pyhton
pfisterer/apache-hadoop-helm
Helm chart for Apache Hadoop using multi-arch docker images
Keerthivasan13/CSCI572-Information_Retrieval_And_Web_Search_Engines
Search Engine projects
QiushiSun/Distributed-Computing-Systems
2021 Spring (Distributed Computing Systems) 分布式系统与编程
James-QiuHaoran/distributed-computing-platform-mapreduce
This repository contains a simple Hadoop-like (MapReduce) distributed computing platform implemented in Java. It is extended from a course project at UIUC awarded the best Java version implementation and it's open-sourced for reference.
SAKET-SK/Semester6-SPPU-Data-Analysis-Lab
I installed Hadoop on Virtual Machine and all Assignments are performed on Ubuntu OS. Refer to this repo for completion of the Hadoop Assignments. It is recommended that you have a stable internet connection while doing these things.
FirasKahlaoui/hadoop-docker-spark
Report : (Docker-Hadoop) installation - Analyse data with Spark (Scala)
waltherg/distributable_docker_sql_on_hadoop
Toy Hadoop cluster combining various SQL-on-Hadoop variants
Areesha-Tahir/Hadoop-MapReduce-Sentiment-Analysis-Through-Keywords
A MapReduce program to conduct sentiment analysis of a keyword from a list of comments.
hyeonsangjeon/dataplatform
Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
giovannigarifo/bigdata
Code samples, summaries, cheatsheets and other study material for Hadoop MapReduce and Apache Spark
pasqualesalza/elephant56
A Genetic Algorithms framework for Hadoop MapReduce.
guillaume6pl/mr_pagerank
Computing pagerank with Hadoop MapReduce
imsanjoykb/PySpark-Bootcamp
My Practice and project on PySpark
manasbundele/big-data-projects
These are a select few projects related to Big Data Analytics and Management. The projects listed are a combination of both small and big projects but interesting ones.
shask9/Matrix-Multiplication-Hadoop
Hadoop MapReduce program to compute multiplication of two sparse matrices
suselong/bigData-30-Days
零基础大数据学习笔记
LMAPcoder/Hadoop-on-Colab
Installation and configuration of Hadoop on Google Colaboratory
MariaDukmak/Hadopy
Easy parallel map-reduce command line tool