Learning data wrangling

I'm learning data wrangling by reading the book Data Wrangling with R by Gustavo R. Santos.

The idea of this repo is to collect the code for the book. So why even have my own version of the code for something that is already available?

  1. Learning: I learn the best when I make the code 'my own'.
  2. tidyverse: And the R dialect I like the best is {tidyverse}. Nothing against base R or data.table, but I prefer the tidy syntax. And I've gone through the code thinking is there a way I could rewrite this using tidyverse-related packages and functions.
  3. Readability: I strive to follow the tidyverse style guide. I'm sure I don't always succeed, but I try to modify the code to be easier to read. Whether it's adding comments or using the pipe (%>%) instead of nested functions.
  4. Helping others: It's a cliché, but if even one person reads this and finds even parts of it useful, I'm happy.

Disclaimer!

This repo is not meant to replace the book in any way. You should definitely read the book. It will help you understand data wrangling in general much better than just looking at the code or playing with it.

Also, I would recommend you buy the book. Here's a direct link to Packt's (the publisher) website.