arjunmajum/vln-bert

How to perform the data preprocessing

Closed this issue · 0 comments

In your INSTALL.md, data preprocessing is summarized as a python script to download existing processed files.

However, as your VLN-BERT model is transferable, I would like to try this on other datasets and other models. Hence, how to pre-process the data is very important to me.

To be specific, I would like to learn about the settings of how to generate beam_[dataset].json and the scores_[dataset].json and which model is used to compute those beams (Is it just following this github repo? Have you ever changed anything?). As if we would like to combine VLN-BERT with Follower and Speaker, we have to use data from beam_[dataset].json.

Could you please release more details about that?

Thanks a lot!