

Opened this issue · 2 comments

描述这个 bug

如何复现 --model=mBART --model_path=facebook/mbart-large-cc25 --dataset=wmt19-zh-en --src_lang=zh_CN --tgt_lang=en_XX

23 Apr 00:43 INFO Pretrain type: pretrain disabled
:1: SyntaxWarning: 'int' object is not callable; perhaps you missed a comma?
:1: SyntaxWarning: 'int' object is not callable; perhaps you missed a comma?
:1: SyntaxWarning: 'int' object is not callable; perhaps you missed a comma?
:1: SyntaxWarning: 'int' object is not callable; perhaps you missed a comma?
:1: SyntaxWarning: 'int' object is not callable; perhaps you missed a comma?
:1: SyntaxWarning: 'int' object is not callable; perhaps you missed a comma?
:1: SyntaxWarning: 'int' object is not callable; perhaps you missed a comma?
:1: SyntaxWarning: 'int' object is not callable; perhaps you missed a comma?
:1: SyntaxWarning: 'int' object is not callable; perhaps you missed a comma?
:1: SyntaxWarning: 'int' object is not callable; perhaps you missed a comma?
:1: SyntaxWarning: 'int' object is not callable; perhaps you missed a comma?
:1: SyntaxWarning: list indices must be integers or slices, not tuple; perhaps you missed a comma?
:1: SyntaxWarning: 'str' object is not callable; perhaps you missed a comma?
:1: SyntaxWarning: list indices must be integers or slices, not tuple; perhaps you missed a comma?
:1: SyntaxWarning: list indices must be integers or slices, not tuple; perhaps you missed a comma?
:1: SyntaxWarning: list indices must be integers or slices, not tuple; perhaps you missed a comma?
:1: SyntaxWarning: list indices must be integers or slices, not tuple; perhaps you missed a comma?
:1: SyntaxWarning: list indices must be integers or slices, not tuple; perhaps you missed a comma?
:1: SyntaxWarning: list indices must be integers or slices, not tuple; perhaps you missed a comma?
:1: SyntaxWarning: list indices must be integers or slices, not tuple; perhaps you missed a comma?
:1: SyntaxWarning: list indices must be integers or slices, not tuple; perhaps you missed a comma?
:1: SyntaxWarning: list indices must be integers or slices, not tuple; perhaps you missed a comma?
:1: SyntaxWarning: list indices must be integers or slices, not tuple; perhaps you missed a comma?
:1: SyntaxWarning: list indices must be integers or slices, not tuple; perhaps you missed a comma?
:1: SyntaxWarning: list indices must be integers or slices, not tuple; perhaps you missed a comma?
Token indices sequence length is longer than the specified maximum sequence length for this model (1776 > 1024). Running this sequence through the model will result in indexing errors
Traceback (most recent call last):
File "", line 15, in
run_textbox(model=args.model, dataset=args.dataset, config_file_list=args.config_files, config_dict={})
File "/hy-tmp/TextBox/textbox/quick_start/", line 20, in run_textbox
experiment = Experiment(model, dataset, config_file_list, config_dict)
File "/hy-tmp/TextBox/textbox/quick_start/", line 56, in init
self._init_data(self.get_config(), self.accelerator)
File "/hy-tmp/TextBox/textbox/quick_start/", line 82, in _init_data
train_data, valid_data, test_data = data_preparation(config, tokenizer)
File "/hy-tmp/TextBox/textbox/data/", line 24, in data_preparation
File "/hy-tmp/TextBox/textbox/data/", line 120, in tokenize
ids = tokenizer(
File "/usr/local/miniconda3/envs/TextBox/lib/python3.8/site-packages/transformers/", line 2538, in call
encodings = self._call_one(text=text, text_pair=text_pair, **all_kwargs)
File "/usr/local/miniconda3/envs/TextBox/lib/python3.8/site-packages/transformers/", line 2624, in _call_one
return self.batch_encode_plus(
File "/usr/local/miniconda3/envs/TextBox/lib/python3.8/site-packages/transformers/", line 2815, in batch_encode_plus
return self._batch_encode_plus(
File "/usr/local/miniconda3/envs/TextBox/lib/python3.8/site-packages/transformers/", line 428, in _batch_encode_plus
encodings = self._tokenizer.encode_batch(
TypeError: TextEncodeInput must be Union[TextInputSequence, Tuple[InputSequence, InputSequence]]


你可以临时注释 中的27~34行,我们之后会尽快修复
