/ih_datamadpt0923_project_m3

Primary LanguageJupyter NotebookGNU General Public License v3.0GPL-3.0

📣 📣 📣 Diamond Price Prediction

Image

This project aims to develop regression models to predict diamond prices using historical transaction data.

💥 Project Description

In this project, we explore a dataset containing information about diamond transactions, including features such as carat weight, cut type, clarity, color, and city of transaction. Using these features, we build regression models to predict the price of diamonds.

Repository Structure

data/: This directory contains the datasets used in the project. notebooks/: Here are the Jupyter notebooks used for data analysis and model construction.

1_RandomForestRegressor.ipynb: Notebook containing data preprocessing, including data loading, removal of unnecessary columns, and one-hot encoding of categorical variables.

2_Model_Training.ipynb: Notebook where regression models are trained using Random Forest and XGBoost.

solution_1.csv: CSV file containing predictions from the Random Forest model on the test dataset.

solution_2.csv: CSV file containing predictions from the XGBoost model on the test dataset.

README.md: This file providing information about the project.

Dependencies

This project requires the following dependencies:

  • Python 3.x

  • pandas

  • scikit-learn

  • XGBoost

💡 Contributions

Contributions are welcome! If you'd like to contribute to this project, please open an issue or submit a pull request.

📁 Folder structure

└── ih_datamadpt0923_project_m3
    ├── 1_RandomForestRegressor
    ├── 2_XGBOOST
    ├── data
    │── LICENSE
    ├── solution_1 
    └── solution_2