Exploratory Data Analysis (EDA) is the initial and an important phase of data analysis. Through this phase, analysts/modelers will have a first look of the data, and thus generate relevant hypothesis and decide next steps. However, the EDA process could be a hassle at times. This R
package aims to automate most of data handling and visualization, so that users could focus on studying the data and extracting insights.
The package can be installed from github
using devtools
package.
if (!require(devtools)) install.packages("devtools")
library(devtools)
install_github("boxuancui/DataExplorer")
If you would like to get the latest development version, you may run the following code in R
.
if (!require(devtools)) install.packages("devtools")
library(devtools)
install_github("boxuancui/DataExplorer", ref="develop")
The package is extremely easy to use. Almost everything could be done in one line of R
code. Please refer to the package manuals for more information.
To get a report for the iris
dataset:
library(DataExplorer)
GenerateReport(iris)
To get a report for the diamonds
dataset in ggplot2
package:
library(DataExplorer)
library(ggplot2)
GenerateReport(diamonds)