/ucidata

R data package containing data sets on UCI's ML repo

Primary LanguageR

ucidata - Data Sets from UC Irvine’s ML Library

R-CMD-check

The following is an R data package that features certain data sets from the Machine Learning Library at UC Irvine. These data sets have been cleaned up and provide documentation via R’s help system.

Note

Want to easily access data sets not included in this package?

Check out the {ucimlrepo} R package! The package provides an interface to download and automatically load data sets from the UC Irvine Machine Learning Repository.

Installation

You can install ucidata from github with:

# install.packages("remotes")
remotes::install_github("coatless-rpkg/ucidata")

Using data in the package

There are two ways to access the data contained within this package.

The first is to load the package itself and type the name of a data set. This approach takes advantage of R’s lazy loading mechansim, which avoids loading the data until it is used in R session. For details on how lazy loading works, please see Section 1.17: Lazy Loading of the R Internals manual.

# Load the `ucidata` package
library("ucidata")

# See the first 10 observations of the `autompg` dataset
head(autompg)

# View the help documentation for `autompg`
?autompg

The second approach is to use the data() command to load data on the fly without and type the name of a data set.

# Loading `autompg` without a `library(ucidata)` call
data("autompg", package = "ucidata")

# See the first 10 observations of the `autompg` dataset
head(autompg)

# View the help documentation for `autompg`
?autompg

Included Data Sets

The following data sets are included in the ucidata package:

Build Scripts

Want to see how each data set was imported? Check out the data-raw folder!