MTLV (Multi-Task Learning Visulizer)

One of the challenges of multi-task learning is negative transfer as the it is probem dependent. To compare and investigate which how each task/head/model is performing with different set of tasks, we designed MTLV to help you investigate how your design/optimization is going to affect the learnign of a model.

Table of Content
1. How to clone the repo?
2. Set up the environment
3. Run baseline experiments
4. Run MTL experiments
4.1 Architecture choice
5. MLflow Tracking

1. How to clone the repo?

As this repo includes a sub module, the clone part is a bit different:

Learn more about sub modules here.

To clone the repo simply use:
git clone --recurse-submodules https://github.com/maktaf/radiologyreportBESTproject.git

If you have already clone it without --recurse-submodules, this is how to add the submodules:

Clone the repo
git clone https://github.com/maktaf/radiologyreportBESTproject.git
change the directory to the repo
cd radiologyreportBESTproject
The submodule is the mimic-cxr directory, but empty.
First run: git submodule init
This will initialize your local configuration file

Then: git submodule update
This will fetch all the data from mimic-cxr project and check out the appropriate commit listed in radiologyreportBESTproject project

2. Set up the environment

./setup.sh

3. Run baseline experiments

$ python3 src/baseline/main.py run --help
Usage: main.py run [OPTIONS]

Options:
  -d, --dataset TEXT     The name of the dataset, choose between: "openI",
                         "news", "twentynewsgroup", "ohsumed", "reuters"
  -c, --classifier TEXT  Name of the classifier, choose between:
                         "randomForest", "logisticRegression", "xgboost"
  -e, --embedding TEXT   Embedding approach, choose between: "tfidf", "bow"
  --help                 Show this message and exit.

Examples: python3 src/baseline/main.py run -d twentynewsgroup -c randomForest -e bow python3 src/main.py run --config src/mtl/config/ohsumed/ohsumed_singlehead1.yml -g 1

4. Run MTL experiments

$ python3 src/main.py run --help
Usage: main.py run [OPTIONS]

  Read the config file and run the experiment

Options:
  -cfg, --config PATH   Configuration file.
  -g, --gpu-id INTEGER  GPU ID.
  --help                Show this message and exit.

python3 src/main.py run --config src/mtl/config/openI_1layer.yml
python3 src/main.py run --config src/mtl/config/openI_singlehead.yml --gpu-id 1

4.1 Architecture configuration

Example:

training:
  type: MTL_cls
  epoch : 25
  batch_size : 16
  use_cuda : True
  cv : True # False
  fold : 5

Architecture	Description
STL_cls	Single Task Learning (a seperate model per task)
MTL_cls	Multi Task Learning (a single model for all tasks)
GMTL_cls	Grouping Multi Task Learning (few models to learn group of tasks)
GMHL_cls	Grouping Multi Head Learning (a single models to learn group of tasks in seperate heads)

Provide the Architecture name in the type section of the training config file.

4.2 Model configuration

Download any of the BioBERT versions bellow and locate them in \model_wieghts directory. Then run the script.sh script that is already located in \model_wieghts to convert the tensorflow weights to pytorch weights. After running this script, pytorch_model.bin is added to the same directory and the library will automatically use it when the model name is indicated in config files.

Example:

model: 
  bert:
    model_name: bert-base-uncased
    freeze: False

Model Name	Description
bert-base-uncased	Trained on Wikipedia and book corpus
bert-base-cased	Trained on Wikipedia and book corpus
BioBERT-Basev1-1	based on BERT-base-Cased (same vocabulary) - PubMed 1M // cased
BlueBERT-Base	pretrained on PubMed abstractsand clinical notes (MIMIC-III) // uncased

4.3 Head configuration

Example:

head: 
  MTL:
    heads: MultiLabelCLS
    type: kmediod-labeldesc # givenset meanshift KDE kmediod-label, kmediod-labeldesc 
    # bandwidth: 20
    elbow: 8
    clusters: 4
    # count: 4
    # heads_index : [[0,1,2,3,4,5], [6,7,8,9,10], [11,12,13,14,15], [16,17,18,19]]

Head options:

Head	Description
STL	Single Task Learning (a seperate model per task)
MTL	Multi Task Learning (a single model for all tasks)
GMTL	Grouping Multi Task Learning (few models to learn group of tasks)
GMHL	Grouping Multi Head Learning (a single models to learn group of tasks in seperate heads)

Any given set

head: 
  multi-task:
    heads: MultiLabelCLS
    type: givenset
    count: 4
    heads_index : [[0,1,2,3,4,5], [6,7,8,9,10], [11,12,13,14,15], [16,17,18,19]]

KDE This is for grouping the heads based on count. Bandwidth options: silverman, gridSearch, any given input(non negative int or float)

head: 
  multi-task:
    heads: MultiLabelCLS
    type: KDE
    bandwidth: 20

meanshift This is for grouping the heads based on count. The bandwidth get automatically computed.

head: 
  multi-task:
    heads: MultiLabelCLS
    type: meanshift

Kmediod For grouping heads based on meaning. options for type: kmediod-label, kmediod-labeldesc As some datasets have technical words as their label, a sentence about the meaning of the label is given to the model to find a more meaning ful representation for clustering.

head: 
  multi-task:
    heads: MultiLabelCLS
    type: kmediod-label
    clusters: 4

5. MLflow Tracking

Server: source env/bin/activate
Server: mlflow ui -h $(hostname -f) -p 5000
Local:

loss configuration example options:

sum of head losses

loss:
  type: sumloss

weighted sum of head losses Please note that you need to know how many heads you have. To figure it out. You can first run the same configuration with sumloss to check how many heads the algorithms have calculated for the run.

loss:
  type: weightedsum 
  weights: [0.4, 0.2, 0.2, 0.2]

average of head losses

loss:
  type: avgloss

weighted average of head losses

loss:
  type: weightedavg 
  weights: [0.4, 0.2, 0.2, 0.2]

Datasts:

20newsgroup
We used bydate version for the experiments in the thesis.
The original version is also available in sklearn
Ohsumed
The original version with 23 categories
Ohsmed O10
category description
MIMIC-CXR Dataset Description Steps to Access the data
Open-I
Dataset Description
Download
If you identify openI dataset in the config file. It will automatically download and preprocess it for you.
to see the details: src/mtl/datasets/openI.py
Reuter
The Reuters-21578 corpus consists of 21,578 news stories appeared on the Reuters newswire in 1987. However, the documents manually assigned to categories are only 12,902. These documents are classified across 135 categories[Source].(https://www.mat.unical.it/OlexSuite/Datasets/SampleDataSets-about.htm)
The Original Version with 135 categories Original Version Information files
Descirption of datast and 90 labels
Routers R-10

fatemerhmi/MTLV

MTLV (Multi-Task Learning Visulizer)

1. How to clone the repo?

2. Set up the environment

3. Run baseline experiments

4. Run MTL experiments

4.1 Architecture configuration

4.2 Model configuration

4.3 Head configuration

5. MLflow Tracking

loss configuration example options:

Datasts: