/rstats-ed

List of courses teaching R

rstats-ed

Inspired by the Using Julia in the classroom page and suggestion by @Peter_Griffin's post on RStudio Community as well as the Learn the tidyverse page.

The goal is to create a repository where one can discover courses and learning materials for learning and teaching R.

This is a user curated list and is bound to be non-comprehensive at all times. If you have suggestions for courses to add, please submit a pull request or add an issue.

Online courses

MOOCs

  • Statistics with R Coursera Specialization: Mine Çetinkaya-Rundel, Merlise Clyde, Colin Rundel, David Banks. 5 courses: Introduction to Data and Probability, Inferential Statistics, Linear Regression and Modeling, Bayesian Statistics, and Capstone.

  • JHU Data Science Coursera Specialization: This Specialization covers the concepts and tools you'll need throughout the entire data science pipeline, from asking the right kinds of questions to making inferences and publishing results. In the final Capstone Project, you’ll apply the skills learned by building a data product using real-world data.

  • Chromebook Data Science: Chromebook Data Science (CBDS) is a free, massive open online educational program offered through Leanpub to help anyone who can read, write, and use a computer to move into data science.

Other learning resources

  • RStudio Cloud Primers: RStudio Cloud is a free, cloud based version of the RStudio IDE. Packaged within RStudio Cloud are primers, which are collections of interactive tutorials made with learnr. These primers teach the basics of R and the Tidyverse.

  • DataCamp's R offerings: DataCamp offers interactive R (and Python, Sheets, SQL, and shell) courses on topics in data science, statistics and machine learning following a "learn by doing" philosophy. Courses run interactively in the browser.

  • LinkedIn Learning offerings: LinkedIn Learning (previously lynda.com) offers beginner, intermediate, and advanced courses in R, as well as Python, SQL, Java, C, PHP, Javascript and other languages. All courses are offered as downloadable video content and are used in many universities.

University courses teaching R

2018

  • STA 112 - Better Living with Data Science - Duke University; Mine Çetinkaya-Rundel. Data Science course for first year undergraduates with little to no computing background. Combines techniques from statistics, math, computer science, and social sciences, to learn how to use data to understand natural phenomena, explore patterns, model outcomes, and make predictions. Data wrangling, exploratory data analysis, predictive modeling, data visualization, and effective communication of results. Discussions around reproducibility, data sharing, data privacy.

  • Data Science for Public Management - Brigham Young University; Andrew Heiss. Data science and statistics class for Master of Public Administration (MPA) students with little math or computing experience. Uses ModernDive, DataCamp, R for Data Science, and OpenIntro Statistics to cover tidyverse data wrangling, inference, and hypothesis testing. All projects and in-class examples use data related to public affairs, administration, and policy.

  • Data Visualization - Brigham Young University; Andrew Heiss. Data visualization class for Master of Public Administration (MPA) students with some experience with R. Uses Alberto Cairo's The Truthful Art: Data, Charts, and Maps for Communication, Kieran Healy's Data Visualization: A Practical Introduction, Claus Wilke's Fundamentals of Data Visualization, and R for Data Science to cover principles of graphic design and fundamentals of visualizing data with ggplot2.

  • Reproducible & Collaborative Data Science - UC Berkeley, Carl Boettiger. Data Science course for first year graduate students in both the natural and social sciences. A modular, flipped-classroom approach that combines reading, exercises and videos based on R for Data Science and DataCamp with more open-ended assignments to replicate, extend, and sometimes challenge key results from the scientific literature on global change. Note: an upper-division undergraduate version of the course is also being developed under the title Data Science for Global Change Ecology

  • Statistical Computing - Duke University, Colin Rundel. Statistical programming with R and its interfaces with custom code development for central statistical models. Best practices and software development for reproducible results, selecting topics from: use of markup languages, understanding data structures, design of graphics, object oriented programming, vectorized code, scoping, documenting code, profiling and debugging, building modular code, and version control- all in contexts of specific applied statistical analyses.

  • DSCI 521: Computing Platforms for Data Science - University of British Columbia, Tiffany Timbers. How to install, maintain, and use the data scientific software "stack". The Unix operating system, integrated development environments (Jupyter and RStudio), and problem solving strategies.

  • CT5102: Programming for Data Analytics - NUI Galway, Jim Duggan: A module that is part of the M.Sc. Computer Science (Data Analytics). There are twelve topics, and these will be updated as the course progresses. The course structure has main three elements: (1) Base R (Vectors, Functions, Lists, Matrices and Data Frame), (2) Data Science, with the tidyverse packages in R (ggplot2, dplyr, readr, tidyr, lubridate and stringr), (3) Advanced R, including closures, object systems (S3, S4 and RC), and building packages.

  • A Jupyter + R ( + mybinder.org) tutorial for social scientists - Goldsmiths, University of London, Caspar Addyman - A self taught or one session class taught to our Masters Level Advanced Methods class. Using an online notebook shared using MyBinder, it goes through basics of editing code and markdown in Jupyter notebooks, how to host these on MyBinder with fixed date snapshots for reproducibility. It provides very simple examples of loading local or remote data files, filtering and graphing with tidyverse and running simple statistical tests using ezANOVA

  • Microsoft Research Data Science Summer School - Microsoft Research New York, Jake Hofman. An intensive, eight-week hands-on introduction to data science for college students in the New York City area focused on increasing diversity in computer science. Students learn Git, Bash, and R, focusing on concepts in statistics, modeling, and machine learning. All coursework is available on Github. Students produce an original group research project at the end of the program. Projects from the past several years are available here, along with corresponding data and code.

2017

  • Modeling Social Data - Columbia University, Jake Hofman. One semester class for upper division undergraduate and first year graduate students that focuses on data-driven modeling for large-scale, social data. Material draws on statistics, computer science, and the social sciences. R is the primary language taught for the course, students gain experience collecting, cleaning, analyzing, and modeling with the tidyverse and related tools. All slides, code, and student-scribed notes are available on Github. Students complete a final project in small groups where work on an original research problem of their choice.