Predicting Diabetes

Using the Pima Indians Diabetes Database, build a gaussian naive bayes supervised machine learning model to accurately predict whether or not the patients in the dataset have diabetes or not?

Original data file is pima-data.csv or can be downloaded at Pima Indians Diabetes Database.

After preprocessing using the data-preprocessing.py and plot-correlation.py programs, the cleaned data has been exported to precessed-pima.csv.

The training and evaluation on the model is implemented in pima-prediction.py.

Achieved Accuracy of 75% approximately on test-spilt.

Still working on to improve accuracy..

bg