The assignment focuses on bias in Artificial Intelligence. The sources of bias, detection, and mitigation are the key points covered under this assignment. The assignment is divided into 2 parts, the first one focuses on evaluation metrics and detection of bias in the system. This is performed on MNIST data, and bias detection is done for ‘1’ and ‘7’ in the dataset. The former one focuses on neural network performance for detecting the bias and mitigating the bias via Data addition and multitasking techniques.
- Data Sets
- http://yann.lecun.com/exdb/mnist/
- https://drive.google.com/file/d/1WYb3Xonb52ZPOpyN58t0WTQPXUnwDg0U/view?usp=sharing
- Google Colab
- Libraries
- Sklearn
- Numpy
- Matplotlib
- OpenCV
- access MNIST Dataset and create Training-Testing Dataset(randomizing data)
- Segregate 1’s(6000) and 7’s(500) data and labels
- Take 3 different 7’s samples of 500
- Train SVM and Neural Network Model for 3 different training sets
- Test respective models and find confusion matrix and Prediction probability
- Find Mean Accuracy and standard deviation
- Plot ROC and Precision-Recall for different training set
- Access the images from provided folders, create training and testing dataset from folders (1) “.\specs_train”, (2) “.\specs_test”, (3) “.\nonSpecs_train”, (4) .\nonSpecs_test”, and (5) “.\data”
- Resizing images to 32*32
- Training neural network of architecture [ 128 -- 128 -- 128 -- 64 -- 1 ], activation function ‘sigmoid’ used
- Finding out bias with the help of the confusion matrix
- Mitigating the bias by: e. 1. DATA method (Training using more data): You may use more data for training. Use (5) “.\data” folder for extra images. e. 2. ALGORITHMIC method: Alter loss function to incorporate more challenges. Use a multi-tasking approach to achieve your aim.
Dataset URL MNIST Data Set http://yann.lecun.com/exdb/mnist/ Face Image Dataset https://drive.google.com/file/d/1WYb3Xonb52ZPOpyN58t0WTQPXUnwDg0U/view?usp=sharing
-----------------------FOR DETAILS REFER MNIST and FACE DATASET PDF’s----------------------