Materials for the 2022 GESIS workshop "Tools and Workflows for Reproducible Research in the Quantitative Social Sciences"
by Johannes Breuer, Bernd Weiß, and Arnim Bleier
Please link to the workshop GitHub repository
The focus of the course is on reproducible research in the quantitative social and behavioral sciences. Reproducibility here means that other researchers can fully understand and (re-)use your statistical analyses. The workflows and tools covered in this course will ultimately facilitate your work as they, e.g., allow you to automate analysis and reporting tasks. This course aims to introduce participants to tools and processes for reproducible research and enable them to use those for their work. In addition to a conceptual introduction to the methods and key terms around reproducible research, this course focuses on procedures for making a data analysis with R
fully reproducible. We will cover questions about project organization (e.g., folder structures, naming schemes, documentation) and choosing and working with tools such as command-line interfaces (PowerShell
, Bash
, etc.), RStudio and R Markdown
, Git
and GitHub, Jupyter Notebooks
and Binder
.
The workshop is targeted at participants who have (at least some) experience with R
and want to learn (more) about workflows and tools for making the results of their research reproducible.
By the end of the course participants should:
- have gained important insights into key concepts of reproducible research and recommended best practices
- be able to work with state-of-the art frameworks and tools, such as
R Markdown
,Jupyter
,Git
, and Binder - be able to publish reproducible computational analysis pipelines with
R
Participants should have some basic knowledge of R
. While this is not required per se, participants who have experience doing statistical analysis in R
will benefit most from this course. To facilitate applying the methods covered in the course to their work, we recommend that participants ensure to install all necessary software on their computers before the start of the course.
Time | Topic | Slides | Exercises | Solutions |
---|---|---|---|---|
10:00 - 11:00 | Introduction | HTML, PDF | - | - |
11:00 - 12:00 | Computer literacy | HTML, PDF | see slides | see create-project.sh |
12:00 - 13:00 | Lunch Break | - | - | - |
13:00 - 15:00 | Introduction to R Markdown | HTML, PDF | HTML | HTML |
15:00 - 15:15 | Coffee Break | - | - | - |
15:15 - 17:00 | Git & GitHub - Part I | HTML, PDF | see slides | - |
Time | Topic | Slides | Exercises | Solutions |
---|---|---|---|---|
09:30 - 10:30 | Git & RStudio | HTML, PDF | HTML | HTML |
10:30 - 10:45 | Coffee Break | - | - | - |
10:45 - 11:45 | Jupyter Notebooks & Binder | Project | - | |
11:45 - 12:30 | Build your own Binder | - | Project | |
12:30 - 13:30 | Lunch Break | - | - | - |
13:30 - 14:00 | Sharing & publishing | HTML, PDF | - | - |
14:00 - 14:15 | Coffee Break | - | - | - |
14:15 - 15:45 | Git & GitHub - Part II | HTML, PDF | see slides | - |
15:45 - 17:00 | Recap & Outlook | HTML, PDF | - | - |
The R Markdown
parts of this workshop were created using the R
packages xaringan
, unilur
, and woRkshoptools
. The materials are based on an earlier version of this workshop and a similar course by Frederik Aust and Johannes Breuer.