/reproducible-research-gesis-2022

Materials for the 2022 GESIS Training workshop "Tools and Workflows for Reproducible Research in the Quantitative Social Sciences"

Primary LanguageHTML

Tools and Workflows for Reproducible Research in the Quantitative Social Sciences

Materials for the 2022 GESIS workshop "Tools and Workflows for Reproducible Research in the Quantitative Social Sciences"

by Johannes Breuer, Bernd Weiß, and Arnim Bleier

Please link to the workshop GitHub repository


Workshop description

The focus of the course is on reproducible research in the quantitative social and behavioral sciences. Reproducibility here means that other researchers can fully understand and (re-)use your statistical analyses. The workflows and tools covered in this course will ultimately facilitate your work as they, e.g., allow you to automate analysis and reporting tasks. This course aims to introduce participants to tools and processes for reproducible research and enable them to use those for their work. In addition to a conceptual introduction to the methods and key terms around reproducible research, this course focuses on procedures for making a data analysis with R fully reproducible. We will cover questions about project organization (e.g., folder structures, naming schemes, documentation) and choosing and working with tools such as command-line interfaces (PowerShell, Bash, etc.), RStudio and R Markdown, Git and GitHub, Jupyter Notebooks and Binder.

Target group

The workshop is targeted at participants who have (at least some) experience with R and want to learn (more) about workflows and tools for making the results of their research reproducible.

Learning objectives

By the end of the course participants should:

  • have gained important insights into key concepts of reproducible research and recommended best practices
  • be able to work with state-of-the art frameworks and tools, such as R Markdown, Jupyter, Git, and Binder
  • be able to publish reproducible computational analysis pipelines with R

Prerequisites

Participants should have some basic knowledge of R. While this is not required per se, participants who have experience doing statistical analysis in R will benefit most from this course. To facilitate applying the methods covered in the course to their work, we recommend that participants ensure to install all necessary software on their computers before the start of the course.

Timetable & content

Day 1

Time Topic Slides Exercises Solutions
10:00 - 11:00 Introduction HTML, PDF - -
11:00 - 12:00 Computer literacy HTML, PDF see slides see create-project.sh
12:00 - 13:00 Lunch Break - - -
13:00 - 15:00 Introduction to R Markdown HTML, PDF HTML HTML
15:00 - 15:15 Coffee Break - - -
15:15 - 17:00 Git & GitHub - Part I HTML, PDF see slides -

Day 2

Time Topic Slides Exercises Solutions
09:30 - 10:30 Git & RStudio HTML, PDF HTML HTML
10:30 - 10:45 Coffee Break - - -
10:45 - 11:45 Jupyter Notebooks & Binder PDF Project -
11:45 - 12:30 Build your own Binder - PDF Project
12:30 - 13:30 Lunch Break - - -
13:30 - 14:00 Sharing & publishing HTML, PDF - -
14:00 - 14:15 Coffee Break - - -
14:15 - 15:45 Git & GitHub - Part II HTML, PDF see slides -
15:45 - 17:00 Recap & Outlook HTML, PDF - -

Acknowledgments

The R Markdown parts of this workshop were created using the R packages xaringan, unilur, and woRkshoptools. The materials are based on an earlier version of this workshop and a similar course by Frederik Aust and Johannes Breuer.