rdds
There are 13 repositories under rdds topic.
roshankoirala/pySpark_tutorial
Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning
akshitvjain/realtime-twitter-trends-analytics
A big data project to develop a real-time data pipeline for analyzing the popularity and sentiments of trending topics on Twitter.
aiwithqasim/pyspark_bigdata
Getting started with PySpark for Big data analysis
TrainingByPackt/Big-Data-Processing-with-Apache-Spark-eLearning
Efficiently tackle large datasets and perform big data analysis with Spark and Python
Ayoub-etoullali/Activites-Pratiques-BigData
MapReduce Job Development, RDDs Programming, Medical Data Management, Sales Analysis, And Efficient Data Integration For Big Data Analysis. Spark: Big Data Processing, SQOOP Integration, And Spark Structured Streaming For Real-Time Data.
thiagoneye/course-pyspark
Pyspark studies.
AjmalSarwary/IoT---assignment-IBM-Data-Science-Specialization
This assignment was part of an IoT motion sensor App running on a watch, predicting actions of the individual wearing the watch based on his arm movements; this IoT Analytics assignments is one of a series of data pipeline coding challenges in the IBM course Scalable Data Science.
DavideAG/BigData
Spark, RDDs and Map Reduce applications related to the BigData @Polito course (2019-2020). A set of personal notes are already provided.
drewm8080/data_mining_spark_rdds
Data Mining using Spark Rdds
lakshay2k/Spark_Playground
Here I play with the services offered by Apache Spark and try to learn them in more depth.
mdarm/map-reduce-project
Project on MapReduce for the Μ111 - Big Data Management course, NKUA, Spring 2023.
quadrantofsola/PySpark_RDD
Analysis of Clinical Trial Dataset using PySpark RDD implementation.
Thanaraklee/PySpark-Big-Data-RDD-Operations
This project illustrates Apache Spark RDD operations, from creation and transformation to actions and results, enhancing users' understanding of distributed data processing.