Towards Explainable Table Interpretation Using Multi-view Explanations

ExplainTI, a framework of explaining table interpretation using multi-view explanations. ExplainTI consists of two phases: tables are converted to sequences and lightweight column graphs firstly, then a pre-trained transformer encoder is fine-tuned to aggregate contextual information and provide multi-view explanations: (i) Local Explanations, are the most relevant phrases (w.r.t. pairwise phrases) in columns (w.r.t. column pairs) for the predictions; (ii) Global Explanations are the most influential samples from training data, which tend to have similar labels to the input sample; (iii) Structural Explanations, we design a relatively small-scale but highly effective method to construct column graphs for tables, which can not only aggregate contextual information but also provide explanations from structural view via graph attention mechanism.

Framework

Requirements

Python 3.7.11
PyTorch 1.9.0+cu111
HuggingFace Transformers 4.11.3
faiss 1.7.1
scikit-learn 1.0

Datasets

We used two real world large-scale benchmark datasets from different types for evaluation.

WikiTable

The WikiTable dataset is Web tables collected from Wikipedia. We used the same train/valid/test splits as TURL.

GitTable

The GitTable dataset is the first large-scale relational table corpus. We used its subset organism and split it into train, valid, and test using the radio of 8:1:1.

The raw dataset is from GitTable. The code of preprocessing is in gittable.py and the preprocessed data is in data/GitTable.

Build the column graph

python build_graph.py [<args>] [-h | --help]

e.g.

python build_graph.py --data_name=GitTable --sample_size=16

Training with ExplainTI

To train the model and get the explanations with ExplainTI:

python train.py [<args>] [-h | --help]