This repository contains my solution for the Kaggle Titanic Machine Learning competition.
The Titanic competition is a classic machine learning problem where the goal is to predict whether a passenger survived or not based on various features such as age, sex, ticket class, etc.
The dataset for this competition contains information about passengers aboard the Titanic, including whether they survived or not. It's divided into two sets: training set (with labels) and test set (without labels). The data is provided in CSV format.
train.csv
: Training dataset containing features and labels.test.csv
: Test dataset containing only features.
- Python 3
- Pandas
- NumPy
- Scikit-learn
I approached this problem using a combination of exploratory data analysis, feature engineering, and machine learning algorithms such as Random Forest and Gradient Boosting.
- Clone this repository.
- Install the dependencies using
pip install -r requirements.txt
. - Download the dataset from Kaggle and place
train.csv
andtest.csv
in thedata/
directory. - Run the Jupyter notebook
titanic.ipynb
to see the data analysis and model building process. - Use the trained model to make predictions on the test set.
My final submission achieved an accuracy of X% on the Kaggle leaderboard.
This project was completed by Sadeq Soltani.
This project is licensed under the MIT License - see the LICENSE file for details.