Replication issues

Question

Replication issues

Dayire opened this issue 3 years ago · 3 comments

Dear Authors,

Thank you for sharing your software with us. I am trying to replicate your results but I am having the following issues/comments:

Am I correct to use those pre-processed files from your other Repo (BERT-like-is-All-You-Need)? link . If not, can you share the pre-processed data or how to preprocess it from the raw one?
If the first step is correct, I then used mosi_data as the path to the raw dataset by modifying raw_audio_text_video_dataset.py to load directly the data path root (I did not find any command arguments to input that). So far, this works for mosi
training runs but sadly, the loss does not decrease, I'm using a batch size of 16 for mosi and I have 64GB of GPU memory (4*16)

Is there something I am missing? I'd appreciate your help.

Answer 1 · 2021-09-14T12:47:08.000Z

Actually first two options not needed. Once you download data, you can directly run it following through bash scripts given in the README. Also change the arguments related to mosei dataset. To download the dataset, please follow issue #1

Answer 2 · 2021-09-15T18:28:31.000Z

Thank you, this did help.

Another question, according to the paper, I need to change the following hyper parameters (among others) between MOSI/CMU_MOSI (currently interested in):
Self Attention Blocks, IMA Blocks, Self Attention Heads, IMA Heads

Can you tell me which one are those from the Training Command Line? Or guide me here? I looked everywhere in the code but couldn't find them.

Appreciate!

Answer 3 · 2021-09-21T00:45:52.000Z

Please check the README. I have added the command-line training.
https://github.com/shamanez/Self-Supervised-Embedding-Fusion-Transformer#training-command

Mos MOSI and MOSEI, use --regression-target-mos