Short Cuts for Deep Neural models (2023)

Smit, J.P. - TU Delft

This repository contains code to accompany a Thesis Submitted to EEMCS Faculty Delft University of Technology, in Partial Fulfilment of the Requirements for the Bachelor of Computer Science and Engineering.

Notable credits should go to JoshuaGhost for creating and maintaining ExPred: the Deep Neural model which we have studied for this thesis. The model is an implementation of the paper Explain and Predict, and then Predict Again.

Structure

This repository is a copy of the ExPred repository, with added Jupyter Notebooks. It contains subsequences mined from the FeVer dataset, a big database (90.000 items) of fact queries containing a label 'Supported' or 'Refuted'. The ExPred model draws evidence from Wikipedia pages to give queries those labels.

Interestingly, the ExPred model is not hundred percent fallible, it makes mistakes sometimes. Yet the mistakes can be structural, meaning the model is biased. We design an algorithm to point out the biggest biases of the ExPred model.

Algorithm

The algorithm was designed as follows:

Take the training dataset of FeVer as a dataset. Take ExPred as a model. Take DESQ as a subsequence mining tool.

  1. Mine sequences from the 'Refuted' queries of the dataset.
  2. Repeat with 'Supported' queries of the dataset.
  3. XOR: Combine the two subsets of sequences and remove the duplicates.
  4. Evaluate the mined sequences of both sides.
  5. Compute the model's correlation: confirm that the model output agrees with the training data
  6. Propose 'Unseen claims': claims containing the subsequence, that the model has not yet observed
  7. Perform 'Adverserial Attacks': swap the subsequence for a term that retains the meaning

Results

Click on a sequence name to get to the corresponding notebook. Each notebook contains the code needed to perform the correlation check and adverserial attacks.

  • Subsequence : subsequence that was mined from FeVer.
  • Prediction Class : corresponds to the most prevalent prediction label of the mined subsequence.
  • Relative Support : percentage of cases in which ExPred predicts the same class for the query containing the subsequence.
  • Precision : precision of the ExPred model on the queries in which it is actually right.
  • Success-rate : amount of adverserial attacks in which flipping the meaning also meant flipping the label.
Subsequence Prediction Class
is incapable of being REFUTED
has only ever been REFUTED
does not have REFUTED
is exclusively REFUTED
is not a(n) REFUTED
has yet to REFUTED
is only a(n) REFUTED
was unable to REFUTED
There is a SUPPORTED
was incapable of REFUTED

Reproducing the Research

Here are the steps for those who are interested in reproducing the research.

  1. Clone this repository
  2. Install the requirements for the ExPred model by pip install -r requirements
  3. Run the provided Jupyter Notebooks