/EDA

This document is aim to explain about statistical knowledge

Primary LanguageJupyter Notebook

Fundamental Exploratory Data Analysis

This document is aim to explain about exploratory data analysis using python. Exploratory data analysis is an activity to check fundamental of data description to ensure that our data fit to the next round.

Content

Pandas Basic

  1. Pandas Utilization
  2. Statistic Descriptive
  3. Simple Visualization

EDA Visualization

  1. Exploratory data analysis using univariate Part I (simple)
  2. Exploratory data analysis using univariate Part II (simple)
  3. Exploratory data analysis using multivariate
  4. Outlier detection
  5. Exploratory data analysis of missing value handler
  6. Exploratory data analysis categorical data -- To be continued
  7. Data reduction
  8. Numerical data transformation/normalization

Exploratory Analysis for Mining

Based on the Jiawei Han, dkk. Exploratory data analysis can be used to do preprocessing data in data mining process. There are three main process that we have to do :

  1. Data cleansing/cleaning (missing value, etc)
  2. Data integration and reduction (PCA, SVM, etc)
  3. Data transformation/normalization