/reproducible-research-gesis-2021

Materials for the 2021 GESIS workshop "Tools and Workflows for Reproducible Research in the Quantitative Social Sciences"

Primary LanguageHTML

Tools and Workflows for Reproducible Research in the Quantitative Social Sciences

Materials for the 2021 GESIS workshop "Tools and Workflows for Reproducible Research in the Quantitative Social Sciences"

by Bernd Weiß, Johannes Breuer, and Arnim Bleier

Please link to the workshop GitHub repository


Workshop description

The focus of the course is on reproducible research in the quantitative social and behavioral sciences. Reproducibility here means that other researchers can fully understand and (re-)use your statistical analyses. The workflows and tools covered in this course will, ultimately, also facilitate your own work as they allow you to automate analysis and reporting tasks. The goal of this course is to introduce participants to tools and processes for reproducible research and enable them to make use of those for their own work. In addition to a conceptual introduction to the processes and key terms around reproducible research, the focus in this course will be on procedures for making a data analysis with R fully reproducible. We will cover questions of organization (e.g., folder structures, naming schemes, documentation) as well as choosing and working with the tools for reproducible research (in addition to R and RStudio): Git & GitHub, LaTeX, R Markdown, Jupyter Notebooks, and Binder).

Target group

The workshop is targeted at participants who have (at least some) experience with R and want to learn (more) about workflows and tools for making the results of their research reproducible.

Learning objectives

By the end of the course participants should be...

  • familiar with key concepts of reproducible research workflows
  • able to (start) work(ing) with tools for reproducible research, such as Git, LaTeX, R Markdown, Jupyter Notebooks, and Binder
  • able to publish reproducible computational analysis pipelines with R

Prerequisites

Participants should have some basic knowledge of R. While this is not required, participants who have experience with doing statistical analysis in R will benefit most from this course.

Timetable

Day 1

Time Topic
10:00 - 10:30 Introduction: What is reproducible research?
10:30 - 11:15 Technical basics
11:15 - 11:30 Break
11:30 - 12:30 Introduction to Git
12:30 - 13:30 Break
13:30 - 14:30 Git & RStudio
14:30 - 15:30 Data wrangling with the tidyverse
15:30 - 15:45 Break
15:45 - 17:00 Introduction to R Markdown

Day 2

Time Topic
10:00 - 11:15 Introduction to LaTeX
11:15 - 11:30 Break
11:30 - 12:30 Advanced R Markdown & LaTeX
12:30 - 13:30 Break
13:30 - 14:45 Jupyter Notebooks & Binder
14:45 - 15:00 Break
15:00 - 16:00 Build your own Binder
16:00 - 17:00 Recap & Outlook

Materials

Day 1

Slides

Introduction - Organizational

Introduction - Substantive (link will work only temporarily)

Technical basics (link will work only temporarily)

Introduction to Git (link will work only temporarily)

Git & RStudio

Data Wrangling

Introduction to R Markdown

Exercises

Git & RStudio

Data Wrangling

Introduction to R Markdown

Solutions

Git & RStudio

Data Wrangling

Introduction to R Markdown

Day 2

Slides

Introduction to LaTeX (PDF slides in English, PDF slides in German)

Advanced R Markdown & LaTeX

Jupyter Notebooks & Binder

Build your own Binder and demo repository

Recap & Outlook

Exercises

Advanced R Markdown & LaTeX

Jupyter Notebooks & Binder

Solutions

Advanced R Markdown & LaTeX