I will classify customer groups into segments (conduct customer segmentation) by fitting tree models, using the DecisionTreeClassifier, RandomForestClassifier, and AdaBoostClassifier methods within the sci-kit-learn library, and the XGBClassifier method within the XGBoost package.
The dataset was obtained from Kaggle and was initially created for a data science hackathon hosted by Analytics Vidhya. There are 2 datasets, split into a training dataset and test dataset. I will be using the training dataset with 8068 observations and 11 variables.
https://www.kaggle.com/datasets/kaushiksuresh147/customer-segmentation/data
https://datahack.analyticsvidhya.com/contest/janatahack-customer-segmentation/