GOAL: predict whether the cancer is benign or malignant.
BEST SCORE: 0.9956 AUC (best model - Logistic Regression)
The features:
- ID number
- Diagnosis (M = malignant, B = benign)
- radius (mean of distances from center to points on the perimeter)
- texture (standard deviation of gray-scale values)
- perimeter
- area
- smoothness (local variation in radius lengths)
- compactness (perimeter^2 / area - 1.0)
- concavity (severity of concave portions of the contour)
- concave points (number of concave portions of the contour)
- symmetry
- fractal dimension ("coastline approximation" - 1)
K-Nearest Neighbors
Test (default) | Test (7 neighbors) | |
---|---|---|
Accuracy: | 0.9649 | 0.9649 |
Precision: | 0.9524 | 0.9524 |
Recall: | 0.9524 | 0.9524 |
F1_score: | 0.9524 | 0.9524 |
Logistic Regression
Test (default) | Test (GridSearchCV) | |
---|---|---|
Accuracy: | 0.9766 | 0.9766 |
Precision: | 0.9841 | 0.9841 |
Recall: | 0.9538 | 0.9538 |
F1_score: | 0.9688 | 0.9688 |
Decision Tree
Total metrics:
Test (default) | Test (GridSearchCV) | |
---|---|---|
Accuracy: | 0.92 | 0.95 |
Best model is Logistic Regression
tuned with GridSearchCV.