/628-Module3

Primary LanguageJupyter Notebook

628-Module3

Project information

Team member:

  1. XIYU HAO (xhao33@wisc.edu)
  2. SHUWEI LIU (sliu777@wisc.edu)
  3. DUOHAN ZHANG (dzhang357@wisc.edu)
  4. XINYAN WANG (xwang2587@wisc.edu)

The purpose of this project is to perform a modern statistical analysis from the business dataset. Our methodology includes:

  • Exploratory Data Analysis: to investigate the variables' distributions along with the outliers
  • Data cleansing: to clean the data by imputing some observations which areirrational and create a new cleaned dataset
  • Statistical Model: to construct a model that effectively infers and describes the variables based on the existing data. We also conduct the experiment to select the best model
  • Shiny App: to develop a real-time web application based on our model

Content

data

  • This contains both raw and cleaned data sets we used in the analysis.

code

  • This contains R scripts for the analysis ranging from reading data, exploratory data analysis, data cleaning, data modeling as well as script for Shiny app.

image

  • This contains images obtained from the code and analysis such as figures, tables, and plots.

Executive Summary

  • The two-page document that summarizes the analysis including the introduction, data cleaning, model selection as well as the recommendations for business.

Link to Shiny App