GabriellaCerrai
UCT BSc, Pure Mathematics and Chemistry | Operations Analyst Graduate
DataOrbis Cape Town
Pinned Repositories
Car-Price-Logistic-Regression-Model
Code for a logistic regression model predicting the price of a car using the following predictor variables: body, mileage, engine volume, engine type, registration and year purchased.
R_Machine_Learning_Classification
This project is a classification problem with a response variable to classify handwritten images as the numbers, 'one', 'seven' or 'eight. This is a classification problem using the Cross Entropy loss function, the 'tanh' activation function using h2o deeplearning as well as Trees.
R_regression_analysis2
This regression analysis is used to predict the LBM (lean body mass) of athletes and whether or not there is a difference in LBM for males and females. The analysis uses the following explanatory variables: Sex (0: males, 1: females), Ht (height in cm), Wt (weight in kg), WCC (white cell count), Hg (hemoglobin) amnd Hc (hematocrit).
Simple-K-Means-Clustering
This document features clustering a wine data set. This analysis looks at the 'Colour Intensity' and 'Flavinoids' variables in the data set and is a simple demonstration of how the K-Means method can be used to cluster data in real life.
Simple-Linear-Regression-3-Ways-
This repo contains code to create a simple Linear Regression model in 3 different ways: 1) Python - using statsmodels 2) Python - using sklearn and 3) R using the linear model (lm) function. This code demonstrates how each method returns the same variable coefficients, p-values and other valuable test statistics.
GabriellaCerrai's Repositories
GabriellaCerrai/Car-Price-Logistic-Regression-Model
Code for a logistic regression model predicting the price of a car using the following predictor variables: body, mileage, engine volume, engine type, registration and year purchased.
GabriellaCerrai/R_Machine_Learning_Classification
This project is a classification problem with a response variable to classify handwritten images as the numbers, 'one', 'seven' or 'eight. This is a classification problem using the Cross Entropy loss function, the 'tanh' activation function using h2o deeplearning as well as Trees.
GabriellaCerrai/R_regression_analysis2
This regression analysis is used to predict the LBM (lean body mass) of athletes and whether or not there is a difference in LBM for males and females. The analysis uses the following explanatory variables: Sex (0: males, 1: females), Ht (height in cm), Wt (weight in kg), WCC (white cell count), Hg (hemoglobin) amnd Hc (hematocrit).
GabriellaCerrai/Simple-K-Means-Clustering
This document features clustering a wine data set. This analysis looks at the 'Colour Intensity' and 'Flavinoids' variables in the data set and is a simple demonstration of how the K-Means method can be used to cluster data in real life.
GabriellaCerrai/Simple-Linear-Regression-3-Ways-
This repo contains code to create a simple Linear Regression model in 3 different ways: 1) Python - using statsmodels 2) Python - using sklearn and 3) R using the linear model (lm) function. This code demonstrates how each method returns the same variable coefficients, p-values and other valuable test statistics.