DataSlingers/clustRviz

Code to Create `presidential_speech` data set

Opened this issue · 3 comments

Add code to (re)create presidential_speech data set. This will allow the data set to be re-run as more speeches are recorded. (If we update the "official" version in the package, we may need to version it somehow.)

jjn13 commented

Do we have a preference for where this should be in the project directory structure?

The code has an R script (used for text processing), but also uses python for web scraping, and some bash scripts for moving things around. In total, it's really a small set of directories.

Do we want to add as a .zip for example, or copy over the whole directory?

By default, I'd put it in inst (i.e., random files that get installed but not used by the package), but I think that the data-raw directory is also a common choice. See http://r-pkgs.had.co.nz/data.html#data-extdata

jjn13 commented

sounds good, I've submitted as #40