/DataExplorer

R package to simplify Exploratory Data Analysis

Primary LanguageR

DataExplorer CRAN Version CRAN Downloads

Master Version Build Status codecov.io

Develop Version Build Status codecov.io


Background

Exploratory Data Analysis (EDA) is the initial and an important phase of data analysis. Through this phase, analysts/modelers will have a first look of the data, and thus generate relevant hypothesis and decide next steps. However, the EDA process could be a hassle at times. This R package aims to automate most of data handling and visualization, so that users could focus on studying the data and extracting insights.

Installation

The package can be installed from github using devtools package.

if (!require(devtools)) install.packages("devtools")
library(devtools)
install_github("boxuancui/DataExplorer")

If you would like to get the latest development version, you may run the following code in R.

if (!require(devtools)) install.packages("devtools")
library(devtools)
install_github("boxuancui/DataExplorer", ref="develop")

Examples

The package is extremely easy to use. Almost everything could be done in one line of R code. Please refer to the package manuals for more information.

To get a report for the iris dataset:

library(DataExplorer)
GenerateReport(iris)

To get a report for the diamonds dataset in ggplot2 package:

library(DataExplorer)
library(ggplot2)
GenerateReport(diamonds)