sparkr
There are 30 repositories under sparkr topic.
awesome-spark/awesome-spark
A curated list of awesome Apache Spark packages and resources.
cluster-apps-on-docker/spark-standalone-cluster-on-docker
Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. :zap:
jadianes/spark-r-notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
awesome-spark/learn-by-examples
Real-world Spark pipelines examples
tomaztk/Azure-Databricks
Azure Databricks - Advent of 2020 Blogposts
manuparra/taller_SparkR
Taller SparkR para las Jornadas de Usuarios de R
microsoft/A-TALE-OF-THREE-CITIES
Analyzing the safety (311) dataset published by Azure Open Datasets for Chicago, Boston and New York City using SparkR, SParkSQL, Azure Databricks, visualization using ggplot2 and leaflet. Focus is on descriptive analytics, visualization, clustering, time series forecasting and anomaly detection.
manuparra/MasterDatCom_BDCC_Practice
Practice and Workshop on BigData and Cloud Computing using Docker Containers and OpenNebula. HDFS, hadoop and spark+R
RSummerSchool/R-for-HPC-and-big-data
Slides and lab material for the talk R for HPC and big data at http://rsummer.data-analysis.at
zero323/dlt
Mirror of https://gitlab.com/zero323/dlt
cosmincatalin/cubist-regression
Fit a Cubist regression model on StackOverflow data and make predictions in a distributed manner with SparkR
jaehyeon-kim/sparkr-demo
SparkR Demo
spark-in-a-box/sparkr-build-sandbox
Docker images for testing SparkR builds
duttashi/cheatsheets
A curated list of essential cheatsheets for data analysis, visualization and machine learning using R or Python
manuparra/taller-bigdata-con-r
Taller Big Data con Apache Spark + R desde Databricks cloud
slothkong/r_on_gcloud
R workloads running at scale on Google Cloud
ukdataservice/bdas2017
Course material for the "Encounters with Big Data" course delivered by the UK Data Service at the 2017 Big Data and Analytics Summer School.
konhay/self-service-modeler
Self-service modeling analysis tool based on R language and big data. It integrates SparkR, Rserve, and Mlib machine learning libraries
lix90/Rnotes
R notebooks
d4rthm4ul/R-Cleaning-Exploration-Imputation-Visualization
This repository you are browsing contains intermediate level piece of codes which are useful for cleaning, exploratory analysis, handling of missing data points, outlier detection and different visualization techniques using graphics, ggplot2, tidycharts, ggExtra packages. Also in particular part of the script you can get basic information about SparkR package which is an R package that provides a light-weight frontend to use Apache Spark from R . Do not be shy to fork and make contribute.
MatthiasDE/spark_standalone_docker
Multiple-Node Standalone Spark with R and Python
reeantencamah/R_Linear-Regression_K-Means-Algorithm
Bi and Big Data Analytics, sparkR, Supervised and Unsupervised Machine Learning techniques The project's aim is of applying a supervised and an unsupervised machine learning technique on a dataset to test different models/scenario, interpret the results, perform predictions for each model and visualised the results.
ruz023/Spark-Desmontration
This is a demonstration of using Spark to explore large dataset, by using PySpark and SparkR. The files include loading data, data exploration and using clustering on words of Shakespeare's novels.
TIME-GATE/r-spark-service
用r、spark做的一些统计分析、机器学习实例,待传
ashish-kamboj/BigData-Analytics
Data analysis and Model building on large datasets using Hive and Spark
gomezportillo/sparkR-hadoop
Processing massive datasets in Hadoop and SparkR
jaehyeon-kim/rocker-extra
Extra docker images from rocker/tidyverse