THE APP IS LIVE HERE -----> https://stream2261.herokuapp.com/
An End-to-End Machine Learning Project On Early Breast Cancer Detection Using Support Vector Machine and K-Nearest Neighbor. All The Exploratory Data Analysis, Data Visualization and Model Building is in breastCancerDetection.ipynb File.
Attribute Information:
- Sample code number: id number
- Clump Thickness: 1 - 10
- Uniformity of Cell Size: 1 - 10
- Uniformity of Cell Shape: 1 - 10
- Marginal Adhesion: 1 - 10
- Single Epithelial Cell Size: 1 - 10
- Bare Nuclei: 1 - 10
- Bland Chromatin: 1 - 10
- Normal Nucleoli: 1 - 10
- Mitoses: 1 - 10
Class: (2 for benign, 4 for malignant) Malignant==> Cancerous
Benign==> Not Cancerous (Healthy)
Techniques Used
- Data Cleaning
- Data Visualization
- Machine Learning Modeling
Algortihms Used
- Logistic Regression
- Support Vector Machine
- KNN
- Naivye Bayes
- Random Forest Classifier
Model Evaluation Methods Used
- Accuracy Score
- Confusion Matrix
Packages and Tools Required:
- Pandas
- Matplotlib
- Seaborn
- Scikit Learn
- Jupyter Notebook
Package Installation
- pip install numpy
- pip install pandas
- pip install seaborn
- pip install scikit-learn
- pip install matplotlib
- pip install plotly
- pip install streamlit