This repository provides materials for a session that is part of the I2DS Tools for Data Science workshop run at the Hertie School, Berlin in October 2023. The student-run workshop is part of the course Introduction to Data Science taught by Simon Munzert at the Hertie School, Berlin, in Fall 2023.
This session will introduce you to the intricacies of factor management with R using the "forcats" package, as well as data cleaning and tidying with the "janitor" package. Both packages are essential for efficient data manipulation and ensuring clean and consistent datasets.
The goals of this session are to:
- Equip you with conceptual knowledge about the "forcats" and "janitor" packages.
- Demonstrate various functions and utilities provided by both packages.
- Provide you with practice material on how to efficiently wrangle and clean data with both packages.
- Elena Dreyer (website, twitter)
- Luis Fernando Ramirez Ruiz (website, twitter)
- Shruti Kakade (website, twitter)
- forcats overview at forcats.tidyverse.org
- janitor package on CRAN
- R for Data Science book - part on factors with forcats
The material in this repository is made available under the MIT license.
Elena Dreyer prepared the presentation slides for "forcats" and contributed to the practice material.
Luis Fernando Ramirez Ruiz prepared the presentation slides for "janitor" and contributed to the practice material.