title | author |
---|---|
Materials for D-Lab's R for Data Science |
Dillon Niederhut |
This repository contains the instructor materials for the D-Lab's R intensive.
You can download the contents of this repository with:
git clone https://github.com/dlab-berkeley/r-for-data-science.git
or, by clicking the "Download Zip" button and then extracting the .zip
file.
The instructor of this workshop series will lead you through the activities for each day.
You'll see accumulated teaching notes and examples for each day's topics in the instructor folder. For your convenience, these are available as .Rmd, commented .R files, PDF documents, and HTML slides. The meta-document for this workshop series, which explains the logic behind the structure and topics, can be viewed at the D-Lab guides repository
For information on contributing to this repository, see CONTRIBUTING.md
The standard Drupal workshop descriptions and facetweet postings for this workshop series are in PUBLICITY.md
data/
: data necessary for interactive coding examplesexamples/
save_console_output.R
: R code for saving console output to pdf
instructor/
: teaching notesscripts/
feedback_cleaner.R
: used to clean data for use in Day 3regenrate_files.R
: for regenerating.R
and.pdf
files from.Rmd
This workshop series covers:
- Interacting with R
- Datatypes
- Data structures
- Reading data
- Sanitizing data
- Missing data
- Reshaping data
- Summary statistics
- Plotting
- Linear models
- Non-parametric models
- Functions
- Loops
- Parallelization
- Packages
This workshop uses the following packages:
- Amelia
- devtools
- dplyr
- foreign
- ggplot2
- parallelMap
- RCurl
- roxygen2
- stringr
- tidyr
- XML
D-Lab == Data Intensive Social Science, For All!