Original dataset : https://archive.ics.uci.edu/ml/datasets/diabetes
Kaggle Competitions : https://www.kaggle.com/uciml/pima-indians-diabetes-database
This dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.
- Data Cleaning
- Data Visualization
- Machine Learning Modeling
- Logistic Regression
- KNN
- Support Vector Machine
- Naivye Bayes
- Random Forest Classifier
- Decision Tree
- XGboost
- Logistic Regression : 77.92%
- KNN : 74.92%
- Support Vector Machine : 78.57%
- Naivye Bayes : 77.27%
- Random Forest Classifier : 80.52%
- Decision Tree : 79.22%
- XGboost : 75.32%
-
Clone this repository and unzip it.
-
After downloading,
cd
into theDeployment
directory. -
Begin a new virtual environment with Python 3 and activate it.
-
Install the required packages using
pip install -r requirements.txt
-
Execute the command:
python manage.py runserver
-
Open http://127.0.0.1:8000/ in your browser.
Pandas
Matplotlib
Seaborn
Scikit Learn
Jupyter Notebook
Django
pip3 install -r requirements.txt