The meta-concepts of data analysis, workflows and projects in R.
This is a place to collaborate on resources and to discuss the meta-concepts of data analysis.
The format will likely evolve, but if you would like to contribute, feel free to PR any resources or ideas, or just submit an issue for discussion.
The idea
I think an underated skill is guessing how much complexity a project will have before you start, and picking an appropriate workflow.
— Dean Marchiori (@deanmarchiori) March 5, 2020
Resources
R packages
- Drake - An R-focused pipeline toolkit for reproducibility and high-performance computing
- ProjectTemplate is a system for automating the thoughtless parts of a data analysis project
- workflowr - Organized + reproducible + shareable data science in R
- rrtools - Tools for Writing Reproducible Research in R
- orderly - Lightweight Reproducible Reporting for R
- fnmate - A function definition generator
- dflow - Automatically setup a drake project
Blog Posts
- Benefits of a function-based diet (The {drake} post) - Miles McBain
- Structuring R Projects
- Using {drake} for Machine Learning
Books
Papers
- Packaging Data Analytical Work Reproducibly Using R (and Friends)
- Packaging Data Analytical Work Reproducibly Using R (and Friends)
Talks
- Community Call - Reproducible Research with R
- RMarkdown Driven Development - Emily Riederer
- Community Call: Reproducible workflows at scale with drake
Collaborators
If you would like to tag your twitter or contact info to connect with others: