/data.world-r

R client library for data.world

Primary LanguageRApache License 2.0Apache-2.0

data.world-r

CircleCI cran version

The data.world package is the main R package for working with datasets on data.world.
Using this packages users can:

  • Query and join datasets
  • Choose between SQL and SPARQL
  • Use data.world's REST API (via included dwapi package)

Getting Started

To get started, load the library and checkout the quickstart vignette.

library(data.world)
vignette("quickstart", package = "data.world")

Here is a simple example:

intro_ds <- "https://data.world/jonloyens/an-intro-to-dataworld-dataset"

sample_query <- data.world::qry_sql(paste0(
  "SELECT t.Name, t.Height, s.AssistsPerGame ",
  "FROM DataDotWorldBBallTeam as t ",
  "JOIN DataDotWorldBBallStats as s ON t.Name = s.Name ",
  "ORDER BY s.AssistsPerGame DESC"))

data.world::query(sample_query, dataset = intro_ds)

Notice that dataset is parameter required by most functions and can be provided in two formats:

  1. URL: "https://data.world/jonloyens/an-intro-to-dataworld-dataset"
  2. Path: "jonloyens/an-intro-to-dataworld-dataset"

Installation

To get the current released version from CRAN:

install.packages("data.world")

To get the current development version from GitHub:

devtools::install_github("datadotworld/data.world-r", build_vignettes = TRUE)

Configuration

First, users must obtain an API authentication token at: https://data.world/settings/advanced

IMPORTANT: For your security, do not include your API authentication token in code that is intended to be shared with others.

Configuration can be provided in 3 forms:

  • Via config file
  • Via environment variables
  • At runtime

Config files are the most convenient configuration method. Once saved configuration will be automatically restored when data.world is loaded.

From the R console:

saved_cfg <- data.world::save_config("YOUR API TOKEN")
data.world::set_config(saved_cfg)

For additional configuration options, see ?set_config

Next

Check out the quickstart and query vignette and the package documentation (?data.world).

> vignette("quickstart", package = "data.world")
> ?data.world