Using the Pima Indians Diabetes Database, build a gaussian naive bayes supervised machine learning model to accurately predict whether or not the patients in the dataset have diabetes or not?
Original data file is pima-data.csv or can be downloaded at Pima Indians Diabetes Database.
After preprocessing using the data-preprocessing.py and plot-correlation.py programs, the cleaned data has been exported to precessed-pima.csv.
The training and evaluation on the model is implemented in pima-prediction.py.
Achieved Accuracy of 75% approximately on test-spilt.
Still working on to improve accuracy..