ModuleNotFoundError: No module named 'bert_score'

Question

ModuleNotFoundError: No module named 'bert_score'

Howene opened this issue 3 years ago · 14 comments

Hi, thanks for providing such a powerful tool. After I clone the Textbox from the source code, I tried to run the command:"python run_textbox.py", the result was reported as follows: ModuleNotFoundError: No module named 'bert_score'. Is it a bug? And how to run the code correctly?

Answer 1 · 2021-09-24T01:32:16.000Z

You need to install the package in the requirements.txt.

Answer 2 · 2021-09-24T04:22:38.000Z

You need to install the package in the requirements.txt.

Hi, after I install the package from requirements.txt, it reported that "ModuleNotFoundError: No module named 'files2rouge'.
"

Answer 3 · 2021-09-24T07:21:26.000Z

You can run bash install.sh if you don't have files2rouge.

Answer 4 · 2021-09-24T09:18:01.000Z

You can run bash install.sh if you don't have files2rouge.

Thanks. And when I download the IMDB dataset from google drive, installed the files2rouge and executed the commands:
import argparse

from textbox.quick_start import run_textbox

if name == 'main':
parser = argparse.ArgumentParser()
parser.add_argument('--model', '-m', type=str, default='TransformerEncDec', help='name of models')
parser.add_argument('--dataset', '-d', type=str, default='IMDB', help='name of datasets')
parser.add_argument('--config_files', type=str, default=None, help='config files')

args, _ = parser.parse_known_args()

config_file_list = args.config_files.strip().split(' ') if args.config_files else None
run_textbox(model=args.model, dataset=args.dataset, config_file_list=config_file_list, config_dict={})

it would report that:
Traceback (most recent call last):
File "run_IMDBTransformer.py", line 18, in
run_textbox(model=args.model, dataset=args.dataset, config_file_list=config_file_list, config_dict={})
File "/TextBox/textbox/quick_start/quick_start.py", line 82, in run_textbox
best_valid_score, best_valid_result = trainer.fit(train_data, valid_data, saved=saved)
File "TextBox/textbox/trainer/trainer.py", line 339, in fit
train_loss = self._train_epoch(train_data, epoch_idx)
File "TextBox/textbox/trainer/trainer.py", line 183, in _train_epoch
losses = self.model(data, epoch_idx=epoch_idx)
File "anaconda3/envs/torchforgpu/lib/python3.6/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
result = self.forward(*input, **kwargs)
File "TextBox/textbox/model/Seq2Seq/transformerencdec.py", line 168, in forward
source_text = corpus['source_idx']
KeyError: 'source_idx'

Answer 5 · 2021-09-24T10:51:10.000Z

If you use the source code, please follow this instruction and run in the command line.

If you want to use from API, please first run pip install -e . in the TextBox folder. And then follow this instruction.

Answer 6 · 2021-09-24T12:43:35.000Z

If you use the source code, please follow this instruction and run in the command line.

If you want to use from API, please first run pip install -e . in the TextBox folder. And then follow this instruction.

I used the source code and followed the "Quick-Start" section. I guess that the "IMDB.yaml" file need to be changed as follows:
max_vocab_size: 30000
max_seq_length: 100
split_strategy: "by_ratio"
split_ratio: [0.8,0.1,0.1]
overlength_strategy: "truncate"
language: "English"
task_type: "unconditional"
source_suffix: bin
target_suffix: bin

right?

Answer 7 · 2021-09-24T12:46:27.000Z

If you use the source code, please follow this instruction and run in the command line.
If you want to use from API, please first run pip install -e . in the TextBox folder. And then follow this instruction.

I used the source code and followed the "Quick-Start" section. I guess that the "IMDB.yaml" file need to be changed as follows:
max_vocab_size: 30000
max_seq_length: 100
split_strategy: "by_ratio"
split_ratio: [0.8,0.1,0.1]
overlength_strategy: "truncate"
language: "English"
task_type: "unconditional"
source_suffix: bin
target_suffix: bin

right?

If you use the source code, please follow this instruction and run in the command line.

If you want to use from API, please first run pip install -e . in the TextBox folder. And then follow this instruction.

I am confused about the meaning of the parameter 'source_idx', how should I do can make the "Quick-Start" work well?

Answer 8 · 2021-09-24T13:41:33.000Z

clone the latest the repository
download the IMDB dataset (raw data), and put the corpus.txt in the folder TextBox/dataset/IMDB
run the command python run_textbox.py --model=TransformerEncDec --dataset=IMDB

Answer 9 · 2021-09-25T03:12:39.000Z

clone the latest the repository

download the IMDB dataset (raw data), and put the corpus.txt in the folder TextBox/dataset/IMDB

run the command python run_textbox.py --model=TransformerEncDec --dataset=IMDB

I've done all the above steps and it reported that:
File "run_textbox.py", line 18, in
run_textbox(model=args.model, dataset=args.dataset, config_file_list=config_file_list, config_dict={})
File "TextBox/textbox/quick_start/quick_start.py", line 82, in run_textbox
best_valid_score, best_valid_result = trainer.fit(train_data, valid_data, saved=saved)
File "TextBox/textbox/trainer/trainer.py", line 339, in fit
train_loss = self._train_epoch(train_data, epoch_idx)
File "/data/home/yangzuoxi/HPCC/TextBox/textbox/trainer/trainer.py", line 183, in _train_epoch
losses = self.model(data, epoch_idx=epoch_idx)
File "anaconda3/envs/tfgpu/lib/python3.6/site-packages/torch/nn/modules/module.py", line 1051, in _call_impl
return forward_call(*input, **kwargs)
File "TextBox/textbox/model/Seq2Seq/transformerencdec.py", line 168, in forward
source_text = corpus['source_idx']
KeyError: 'source_idx'

Answer 10 · 2021-09-25T06:44:51.000Z

Sorry I didn't notice before, IMDB is a dataset for unconditional generation, we do not support unconditional generation with Transformer.

Answer 11 · 2021-09-25T07:43:58.000Z

Sorry I didn't notice before, IMDB is a dataset for unconditional generation, we do not support unconditional generation with Transformer.

I think Transformer can be used on the IMDB, would you plan to make it support unconditional generation with Transformer?

Answer 12 · 2021-09-25T08:30:37.000Z

Yes, we plan that. We only support unconditional generation with RNN, now.
Thanks for your suggestion.

Answer 13 · 2021-09-25T08:35:16.000Z

Yes, we plan that. We only support unconditional generation with RNN, now.
Thanks for your suggestion.

And if it is possible, I suggest that you can provide a table to show which model can be used on different datasets.

Answer 14 · 2021-09-26T00:37:01.000Z

OK, I will provide it in the next version. Thank you!