/Liver_Series

Multiple statistical inference using 1 dataset

Primary LanguageJupyter Notebook

Liver Series

  • Data visualization
  • Statistical inference
  • Machine/Deep learning

Part 1

  • Data exploration -shape, dimension, variable types
  • Data visualization - box plot, bar plot, histogram, kernel density estimation, measure of central tendency

Part 2

  • Statistical inference on all categorical variables - chi square test

Part 3

  • Test for gaussian distribution - QQ plot
  • Statistical inference on all categorical and continuous variables - student t-test, ANOVA, Mann-Whitney, Kruskal-Wallis

Part 4

  • Parametric and nonparametric correlation analysis for continuous variables - Pearson, Spearman
  • Data visualization - scatterplot

Dataset: Primary biliary cirrhosis clinical trial from Mayo Clinic conducted between 1974-1984.

Data Source: T Therneau and P Grambsch (2000), Modeling Survival Data: Extending the Cox Model, Springer-Verlag, New York. ISBN: 0-387-98784-3.