Code to Create `presidential_speech` data set
Opened this issue · 3 comments
Add code to (re)create presidential_speech
data set. This will allow the data set to be re-run as more speeches are recorded. (If we update the "official" version in the package, we may need to version it somehow.)
Do we have a preference for where this should be in the project directory structure?
The code has an R script (used for text processing), but also uses python for web scraping, and some bash scripts for moving things around. In total, it's really a small set of directories.
Do we want to add as a .zip for example, or copy over the whole directory?
By default, I'd put it in inst
(i.e., random files that get installed but not used by the package), but I think that the data-raw
directory is also a common choice. See http://r-pkgs.had.co.nz/data.html#data-extdata