Simple demonstration of basic multivariate statistics.
Notebooks can be previewed on nbviewer.
There are four folders classification, clustering, factor_analysis, regression
each corresponds to one field from multivariate statistics.
- Diabetes data - https://scikit-learn.org/stable/datasets/toy_dataset.html#diabetes-dataset
- Titanic data - https://www.kaggle.com/c/titanic/data
- Dutch data - De Deyne, Simon, et al. "Exemplar by feature applicability matrices and other Dutch normative data for semantic concepts." Behavior research methods 40.4 (2008): 1030-1048.
- Five personalities - https://www.kaggle.com/tunguz/big-five-personality-test
pandas
sklearn
statsmodels
pygam
factor_analyzer
plotly