/unit3_Project

This the final project for the unit 3 ( Classical) Machine Learning

Primary LanguageJupyter NotebookMIT LicenseMIT

unit3_Project

Project Overview

Hepatitis C is a viral infection that causes liver inflammation, sometimes leading to serious liver damage. The hepatitis C virus (HCV) spreads through contaminated blood. The infection was one of the main death causes from 1960 to 1970. In this project we will be studying a dataset of Hepatitis C infection. The dataset was donated by the Medical University Hannover (MHH); Hannover, Germany.

You will walk through an example containing some data preparation , classification task and a clustering task to end up with a conclusion part where you will be comparing the two methods and providing explanation about the observed phenomenon.

You will have enough guidance throughout the project sections and this project will be graded and reviewed by a teacher assistant as well as the chance for you to reach out to the TA via slack whenever you feel you are stuck.

The key is to know that the best way to learn is the hardest! :)

Details :

You will be using the HCV data Data Set.

  • The project is divided into 4 notebooks at the end of each you will be saving a results file (except for the final notebook).
  • The first notebook will be tackeling the data cleaning and preparation.
  • The second notebook will be dedicated to the unsupervised learning task.
  • The third notebook will be dedicated to the supervised learning task.
  • The final notebook will be dedicated to your final analysis and conclusions.

Make sure to follow the instuctions in each notebook and comment and explain your steps and analysis

Getting started

  1. Fork this repository into your Github account. To do so, click on the fork button in the upper right hand corner of a repo page.
  2. Head to the forked version on your github then clone it on your local space.
  3. Open the notebooks in order :
  • "1. Data cleaning.ipynb"
  • "2. Unsupervised task.ipynb"
  • "3. Supervised task.ipynb"
  • "4. Conclusion" In each notebook you will find the necessary steps and guidance you need for this project. Remember: only the sections which are marked as "To-Do" are the ones you should be changing. Feel free to manage your versions as you want.
  1. Once you finished your project and you are confident about the results, You have to push those details to your forked version of the project The following Git commands should be helpful for you as a reminder :
  • $git add .
  • $git commit -m "commit msg"
  • $git push origin main

Once you're done working on the project, submit the link to you repository in the platform.