/titanic-survival-prediction

This project tackles the Titanic challenge on Kaggle, predicting passenger survival based on variables like age, sex, and passenger class. The Jupyter notebook covers essential steps of a data science pipeline, including exploratory data analysis, data cleaning, feature engineering, and modeling. The dataset used is the Titanic dataset.

Primary LanguageJupyter Notebook

Titanic-Survival-Prediction

The Titanic challenge on Kaggle is a competition in which the task is to predict the survival or the death of a given passenger based on a set of variables describing him such as his age, his sex, or his passenger class on the boat.

In a form of a jupyter notebook, my solution goes through the basic steps of a data science pipeline:

  • Exploratory data analysis with visualizations
  • Data cleaning
  • Feature engineering
  • Modeling

Dataset used: Titanic dataset